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Abstract 



Understanding the dynamics of spatially extended systems represents a challenge 
in diverse scientific disciplines, ranging from physics and mathematics to the earth 
and climate sciences or the neurosciences. This challenge has stimulated the devel- 
opment of sophisticated data analysis approaches adopting concepts from network 
theory: systems are considered to be composed of subsystems (nodes) which inter- 
act with each other (represented by edges). In many studies, such complex networks 
of interactions have been derived from empirical time series for various spatially 
extended systems and have been repeatedly reported to possess the same, possi- 
bly desirable, properties (e.g. small-world characteristics and assortativity). In this 
thesis we study whether and how interaction networks are influenced by the anal- 
ysis methodology, i.e. by the way how empirical data is acquired (the spatial and 
temporal sampling of the dynamics) and how nodes and edges are derived from 
multivariate time series. Our modeling and numerical studies are complemented by 
field data analyses of brain activities that unfold on various spatial and temporal 
scales. We demonstrate that indications of small-world characteristics and assorta- 
tivity can already be expected due solely to the analysis methodology, irrespective 
of the actual interaction structure of the system. We develop and discuss strate- 
gies to distinguish the properties of interaction networks related to the dynamics 
from those spuriously induced by the analysis methodology. We show how these 
strategies can help to avoid misinterpretations when investigating the dynamics of 
spatially extended systems. 
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1 Introduction 



We live in a world where complex systems are all around us. Understanding, pre- 
dicting, and controlling their dynamics lies at the heart of many of today's global 
challenges, ranging from climate change, global population growth, decrease in bio- 
diversity, spread of infectious diseases, to the global financial crisis at the beginning 
of the 21^* century To meet these challenges and in order to extend our knowledge 
of the world around us, complex systems are studied in various sciences, includ- 
ing physics, mathematics, climate and earth science, quantitative finance, biology, 
medicine and the neurosciences. Breaking down complex systems into their con- 
stituents which are then separately studied has been proven to be a very successful 
approach in the past. However, complex systems can display properties as a whole 
which are not present on the level of single constituents. Thus, the next step towards 
a better understanding of such a system is based on studying its constituents (sub- 
systems) and taking into account their mutual interactions. This approach has been 
pursued in physics, where scientists have made remarkable advances in bridging 
the gap between the microscopic and the macroscopic features of systems (e.g., in 
statistical mechanics). 

During the last decade, research into the dynamics of complex systems has 
adopted and advanced concepts from network theory |jT|]. The rapid propagation of 
network-theoretic ideas in various disciplines such as physics [1-9], biology IIT0] - [T3l , 
sociology IIT414T8B , and the neurosciences l|T9l - l27ll reflects the insight that many nat- 
ural systems can be understood as networks of interacting constituents. The suc- 
cess of network approaches also becomes noticeable in a growing number of more 
specialized reviews recently published in the physics and mathematics literature 
(see reviews focussing on synchronization and critical phenomena [81191, spatial 
networks Il28l , community structure ll29ll30B , edge prediction Il3ll , semantic net- 
works 13211 , random processes on networks 1133113411 ). From the network perspective, 
properties of the dynamics of a complex system are reflected in the topology of an 
interaction network (also called functional network) whose nodes represent subsys- 
tems and whose edges represent interactions between them. In contrast, edges of a 
structural network represent physical connections between subsystems of a natural 
system (e.g., synaptic connections between neurons in the brain). Structural net- 
works serve as the physical substrate of the dynamical patterns observed in interac- 
tion networks. The intricate interrelationships between the dynamics of subsystems. 
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their physical connectivity, and the dynamical patterns displayed by the whole sys- 
tem (i.e., structure-function relationships) are subjects of ongoing research activi- 
ties, including modeling and field studies. 

In field studies, interaction networks are derived from empirical data. The data 
usually consists of a number of time series, each of which is obtained with a sensor 
that is placed so as to efficiently capture the dynamics of a subsystem. Most interac- 
tion networks are derived by associating each sensor with a node, and the inference 
of edges is based on estimates of signal interdependencies between pairs of time 
series (e.g., the Pearson correlation coefficient). Based on this approach, interaction 
networks of various spatially extended systems have been derived and studied. For 
instance, climate networks derived from physical observables such as temperature 
or pressure revealed richly structured topologies indicating the presence of com- 
munities, connections between geographically very distant nodes (teleconnections), 
or properties reflecting the El Nino-Southern Oscillation climate pattern (see, for 
example, references Il35] - l37l '). Moreover, climate networks may turn out to be a use- 
ful tool to investigate the stability of the climate system and the impact of global 
warming (see references II381I39II and references therein). Seismic networks are de- 
rived from time series of the physical observables of earthquake dynamics (see 
references ||40] - |44| and references therein for different approaches towards network 
construction). Some of the findings reported so far indicate that main shocks are re- 
flected in central nodes (also called "hubs", i.e., nodes with more edges than most of 
the other nodes) |i42ll44|| , that long-range connections might reflect large geological 
faults (which transfer stresses) [44J, and that seismic networks may help to iden- 
tify triggered earthquakes Il44ll . In the neurosciences, functional brain networks are 
typically derived from time series obtained via electrophysiological or neuroimag- 
ing techniques such as electroencephalography (EEG), magnetoencephalography 
(MEG), or functional magnetic resonance imaging (fMRI) (see reviews IIT91 - I27II for 
an overview). Network characteristics were reported to reflect physiological pro- 
cesses such as aging [45J, cognitive performance 1146 [|47| , and sleep Il48] - l50l , to be — 
to some extent — heritable IISTI , and also to change in pathological conditions like 
Alzheimer's disease Il52ll53ll , schizophrenia Il54l - l56l , or epilepsy Il57l - |63ll . These find- 
ings indicate that network characteristics may prove useful as diagnostic markers 
for mental and neurological disorders and that the mechanisms causing brain dis- 
orders may be better understood from a network perspective, possibly driving the 
development of novel treatment strategies. 
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Although the aforementioned complex systems differ in types of subsystems and 
interactions, they were reported to share striking features on the level of their inter- 
action networks, a finding which may point — as hypothesized by many research- 
ers — towards a universal organization principle of dynamical systems. For instance, 
seismic EZUll, climate BSSHlHUll, and functional brain networks ||19H20H2l have 
all been repeatedly reported to possess small-world topologies. Such networks dis- 
play strong local connectivity and possess long-range connections (as characterized 
by the network metrics clustering coefficient and average shortest path length). More 
recently, studies of seismic ||43| and brain functional networks 16514711 revealed that 
edges of interaction networks preferentially connect nodes with a similar number 
of edges, a feature called assortativity. Both network characteristics — small-world 
topology and assortativity — have been shown in numerical studies to support the 
resilience of a network to random failures or targeted attacks (removal of some 
nodes or edges). In addition, small- world topologies allow for an efficient transport 
of information, masses, or other entities throughout the network. While resilience 
and efficient transport are desirable features from a biological perspective, where 
evolutionary selection pressures may have shaped the physical substrate of interac- 
tion networks, the interpretation of these findings for non-biological systems is not 
yet quite clear. 

A key challenge when analyzing empirical interaction networks is to reliably as- 
sess whether findings are significant or not, i.e., whether they reflect characteristics 
of the dynamics of the system under study. Such an assessment can pave the way to- 
wards a deeper understanding of the dynamics and is an inevitable prerequisite for 
the interpretation of analysis results and the development of further research strate- 
gies. A common way to establish significance of findings is based on a comparison 
of features of interaction networks with those found in ensembles of random net- 
works BU. If features differ (e.g., according to some statistical test), the finding is 
called significant. In this context, the chosen random network ensemble encodes an 
expectation of what can be assumed to be present "by chance". The vast majority 
of network studies makes use of the very same random network models, regardless 
of whether nodes represent entities embedded in space (e.g., airline networks) or 
not (network of scientific citations), regardless of whether edges represent static re- 
lations (e.g., the physical connections of an electric power grid) or reflect dynamic 
interactions unfolding on certain temporal scales (interactions between neurons), 
and regardless of the actual acquisition of the data which may also be subject to 
various constraints. In the case of spatially extended dynamical systems, the in- 
ference of interaction networks relies on the spatial and temporal sampling of the 
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dynamics inevitably yielding a limited amount of data. Whether and how the way 
empirical data is acquired and interaction networks are derived from time series 
influence properties of interaction networks and the assessment of significance is 
largely unknown. 

In this thesis, we investigate whether and how the spatial and temporal sam- 
pling of spatially extended dynamical systems together with commonly applied 
methods for edge inference influence the topological properties of interaction net- 
works derived from multivariate time series. Moreover, we develop and propose 
strategies which can help to distinguish properties of interaction networks related 
to the dynamics from those spuriously induced by the identified influences. The 
investigations performed involve modeling and numerical studies, as well as field 
data analyses. All these studies are designed, carried out, and interpreted with 
respect to the perspective of researchers who face the challenge of acquiring and 
analyzing data of complex systems. The majority of the presented studies focus on 
small-world characteristics and assortativity as the former have been frequently as- 
sessed in field studies and the latter receives growing attention IlilZZl. To examine 
whether and to what extent findings obtained in modeling and numerical stud- 
ies carry over to field data studies, interaction networks derived from the human 
brain — a prime example of a spatially extended dynamical system whose dynamics 
lives on various spatial and temporal scales — are investigated with respect to spa- 
tial and temporal sampling. These interaction networks are obtained from healthy 
subjects as well as from epilepsy patients. The latter could particularly benefit from 
a better understanding of the disease epilepsy with its most prominent dynamic 
feature: recurring and in many cases uncontrollable epileptic seizures. 

This thesis is organized as follows. In chapter |2l, concepts in the context of interac- 
tion networks are delineated and notation is introduced. To illustrate the network 
approach, exemplary field studies of brain functional networks are presented in 
chapter |3] and their findings are briefly discussed, which shapes the strategy pur- 
sued in the following investigations. The subsequent chapters are devoted to inves- 
tigations of the impact of the spatial sampling (chapter H]) and the impact of the 
temporal sampling (chapter [S]) on properties of interaction networks derived from 
the dynamics of complex systems. Each of these chapters includes an in-depth dis- 
cussion of the findings and possible ways to approach the identified challenges. 
Finally, in chapter [61, the key results of this thesis are summarized, their potential 
impact on other areas of research are discussed, and possible further directions of 
research are outlined. 
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An interaction network is a means to characterize the dynamics of a system. Nodes 
represent subsystems which interact (represented by an edge) or not (no edge) with 
each other. For the inference of interaction networks and for the interpretation of 
their properties, we recall basic definitions, focus on few but important concepts in 
graph theory (section 12. 1[) and time series analysis (section I2.2[) , and introduce the 
notation used in this thesis. 



2.1 Network basics 

A complex network can be studied using concepts from graph theory in which it is 
represented as a graph. An unweighted graph is defined by a non-empty set of nodes 
and a set E of unordered (or ordered) pairs of elements of the set of nodes BU. 
E represents the set of edges connecting the nodes of the undirected (or directed) 
graph. Let N denote the number of nodes, which is also known as the size of the 
graph^ |l6J. A graph is said to have finite size if N < oo. A node i is said to be a 
neighbour of node ; if there is an edge e G E connecting i and A weighted graph 
can be defined by adding a set of values to the sets of nodes and edges. These 
values are usually real numbers and represent weights attached to the edges. Note 
that — unless otherwise stated — we will consider unweighted undirected graphs in 
this thesis, and we will use the notions graph and network interchangeably in the 
following. 

A graph of size N can be represented by a N x N square matrix A., the adjacency 
matrix. For unweighted undirected graphs, entries Aij = Aji of A. indicate whether 
an edge between nodes i and ; exists (Aij = 1) or not (Aij = 0). Adjacency matrices 
of undirected graphs are symmetric, while those of directed graphs are typically 
not. In accordance with the majority of the mathematics or physics literature on 
networks, we do not account for self -connections of nodes, and thus, by definition. 
An = OVz. Weighted graphs can be described by a N x N square matrix W, the 
weight matrix (W;y represents the weight of the edge between i and /). 



^ This is just one example demonstrating the different use of terms in physics and mathematics. In 
the mathematics literature, the size of a graph is the number of edges while the order of a graph 
corresponds to the number of nodes. We will stick to the notations used in physics throughout 
this thesis. 
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Figure 2.1: Sketch of an exemplary network with N = 10 nodes (represented as 
circles) and |E| = 11 edges (black lines). The network is unweighted, undirected, 
and connected. We consider two exemplary nodes i and /. Their degrees are fc/ = 4 
and kj = 2 and both are connected by a shortest path of length /,y = 3. Their 
local clustering coefficients are = and Cj = 1. The mean degree of the network 
amounts tok = 2.2. The edge density can be determined by e = 2\E\/ {N{N — 1)) = 
k/{N-l) ^ 0.24. 

A path from node i to / is a sequence of neighbouring nodes which begins with i 
and ends with / and in which no node is contained more than once BH. The number 
of edges contained in the path is also known as the path length, and a path is said to 
be finite if its length is finite. Different paths may exist between nodes i and /, and 
the paths with the minimum length are known as shortest paths. The length of the 
shortest path between i and ; is denoted by lij (cf . figure 12. 1|) . A network is said to 
be connected if a finite path exists between every pair of distinct nodes i and ; of the 
graph; otherwise, the graph is said to be unconnected or disconnected. A component is 
a subset of nodes and a subset of edges of the graph precisely containing the edges 
that also appear in the graph over the same set of nodes. A component is said to 
be connected if there exists a finite path between every pair of distinct nodes of the 
component. The number of connected components is denoted as Nc, and we will 
also regard the special case of a single disconnected node as a component of the 
graph. 

An important notion in graph theory is the degree ki of a node i, defined as the 
number of neighbours of z, 

fcf = EA7. (2.1) 

i 
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A list of the degrees of all nodes is called the degree sequence in the physics liter- 
ature Closely related to the degrees of nodes is the notion of the degree dis- 
tribution which is considered as one of the most basic characterizations of a graph. 
The degree distribution p{k) — also denoted by p^t — is defined as the probability 
that a node chosen uniformly at random has degree k [|6j]. Equivalently, p{k) is the 
fraction of nodes of the graph possessing degree k. The first moment of the degree 
distribution is known as the mean degree k of the network, 

k = N-^Y;^k,. (2.2) 

i 

Related to the mean degree is the edge density, e = k/{N — 1), which corresponds 
to the number of edges of the graph divided by the number of all possible edges. 

Often one observes that nodes show a tendency to connect to nodes with similar 
or dissimilar degrees, also known as degree-degree correlations. Such a behaviour 
can, for instance, be studied by determining the two-point conditional probability 
p{k'\k) that a neighbour of a node with degree k has degree k' |5l. In other words, it 
is the probability that any edge from a node with degree k connects to a node with 
degree k' . We note that this concept can be extended to multi-point conditional 
probabilities p{k' ,k" , . . . ,k'^^^\k) that a node of degree k is connected to n nodes 
with corresponding degrees k' ,. . [IJ. A network is said to be uncorrelated if 

the degree of any node is independent of the degrees of its neighbours II73L i.e., 
the conditional probability does not depend on k. Note that uncorrelated networks 
may not always exist due to structural constraints related to the finite size of the 
network and its degree sequence (ll. To simplify the notation, we will also call 
networks uncorrelated which show degree-degree correlations due to structural 
constraints only. 

2.1.1 Network characteristics 

In the following, we present network characteristics which have been frequently 
used in numerical, theoretical, and in field data studies. 

Clustering coefficient. In many natural networks, it can be observed that if node 
i is connected to nodes / and m, then there is an increased probability that / and 
m are also connected to each other. This tendency, often referred to as clustering or 
transitivity (in the context of sociology Ulllll), is often associated with a robustness 
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of the network towards random removal of nodes and can be assessed by various 
methods. A prominent method is the clustering coefficient ||74| , 

-1 N 

C=nLCu (2.3) 

1=1 

which is the average of the local clustering coefficients of the network. The local 
clustering coefficient C, is defined as the fraction of the number of existing links 
between neighbours of i among all possible links between these neighbours [|5ll6ll74|, 

' \ 0, iffc,G{0,l}. 

Note that, by the definition of the adjacency matrix [6J, An = OVz, which ensures 
that Cf, C G [0, 1] . Various extensions and alternative definitions of the clustering 
coefficient have been proposed in order to allow for a characterization of weighted 
networks (see, e.g., references Il75]479ll ). Finally we mention that the transitivity of 
a network can also be characterized by the fraction of transitive triples defined as 
the fraction of connected triples of nodes which also form triangles [SlIH. While this 
definition is frequently used in sociology studies fMt] , the definition given in (|2.3|) 
and (|2.4[) is more common in numerical studies and field data analyses [j5]]. 



Average shortest path length. Different approaches can be pursued in order to 
characterize the efficiency of a network to transport information or other entities 
(depending on the type of network considered) between nodes. A prominent net- 
work characteristic based on the concept of shortest paths is the average shortest 
path length 0, 

t-S(FTT)|''" P-^* 

which has been investigated in many studies (see, e.g., chapter 2.2.2 in reference IISOll 
for a brief historical overview). Networks whose average shortest path length scales 
at most logarithmically with the number of nodes are said to possess the small- 
world property. Such networks have small average distances between nodes and are 
regarded as very efficient in terms of information transfer. The exact definition of 
the average shortest path length varies across the literature. We decided to include 
the distance from each node to itself (la = 0) in the average of equation (|2.5|) , as is 
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done in various studies. The exclusion, however, will just alter the value of L by a 
constant factor of (N + 1)/(N - 1) M- 

For disconnected networks, the above definition yields infinite values of the av- 
erage shortest path length since such networks possess nodes i and / for which 
no connecting path exists, and thus Ijj = oo. This is an issue for numerical stud- 
ies in which finite values of this network characteristic are preferred. Several ap- 
proaches have been pursued in order to overcome this issue. For instance, Ijj could 
be replaced by l^^ which leads to the definition of a network measure called ef- 
ficiency Il8ni82| . Another strategy followed in many studies is to exclude infinite 
values of lij from the average. We will adopt this approach in the following, which 
leads to the definition of the average shortest path length as 

^ = ^ E ^ii' (2-6) 

where 

S = {{hj) \kj<oo; z,; = l,...,N} (2.7) 

denotes the set of all pairs {i,j) of nodes with finite shortest path. Note that L — ^ 
for Nc — ^ N, i.e., for a network without edges. Finally we mention that the concept 
of the average shortest path length can be carried over to analyze weighted net- 
works. In this case, the shortest paths determined between nodes take the weight 
of edges into account ll23ll83ll84B. 

Assortativity coefficient. The tendency of nodes of a network to preferentially 
connect to other nodes with similar or dissimilar degree can be quantified in dif- 
ferent ways 11]. A prominent approach, which we will pursue in the following, is 
to evaluate the degree of nodes at either end of edges. Let e G E be an edge of the 
network, and let /g and nte denote the degrees of the nodes at either end of this 
edge. The assortativity coefficient ||72ll85i1 is then defined as 

a = corr(/,m), a G [-1,1], (2.8) 

where corr denotes the correlation coefficient determined between the degrees of 
nodes at either end of edges. We mention that a is not well defined for the spe- 
cial case of regular graphs, i.e., for networks whose nodes all have the same de- 
gree. Negative or positive values of a indicate dissortative or assortative mixing of 
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node-degrees (also referred to as degree-degree correlations), respectively. Networks 
displaying such types of mixing patterns are briefly called dissortative (sometimes: 
disassortative) or assortative networks. Networks which are neither assortative nor 
dissortative are said to be uncorrelated |[Tl|73l. An alternative concept proposed to 
assess degree-degree correlations is the evaluation the two-point conditional prob- 
ability p{k'\k) (see, e.g., reference l'86'l for a study based on empirical data). This 
approach, however, may be sensitively affected by statistical fluctuations if only 
short datasets are available for analysis [T|. To this respect, an approach based on 
the average degree of nearest neighbours seems to be more robust II871I88II . Exten- 
sions of the concept of assortativity have been proposed to quantify the assortativity 
of individual nodes (local assortativity coefficient 189]]) or to account for weighted 
and directed networks |]23]]75l]90B. 



Community structure (clusters). In many networks it can be observed that nodes 
are strongly interconnected within a group of nodes but only weakly or not con- 
nected with the rest of the network. The division of network nodes into such groups 
is called community structure |]9ll , and groups are interchangeably called communi- 
ties, clusters, or modules. The reliable identification of clusters is a challenge in dif- 
ferent scientific disciplines such as social sciences, earth sciences, engineering, life 
sciences, mathematics, and physics (see, e.g. references |]29l]92l]93l for an overview|§). 
Unfortunately, there is no generally accepted formal definition of a cluster, and 
many definitions are rather vague. Instead, clusters are often defined as the out- 
come of some algorithm without a precise a priori definition |]29l . The outcome of 
such algorithms is usually called partition or clustering (not to be confused with 
"clustering" in the context of the clustering coefficient). Methods usually need to 
deal with two challenges, namely to actually identify clusters and to determine 
the number of clusters justified by the data. Hierarchical methods produce a series 
of partitions with a varying number of clusters from which one has to choose, 
while non-hierarchical methods need the number of clusters to be specified prior to 
analysis. Each partition can be evaluated with various quality functions (see refer- 
ences II91U94|]95]1 ), and the partition with the number of clusters is chosen for which a 
quality function (for instance, the thoroughly studied modularity II911I96II ) obtains an 
extremum. Among the many methods available for identifying clusters, we choose 
a method [I971I98II from the domain of spectral clustering. The approach is detailed in 
section I7.1[ 

Reference 1291 pays special attention to contributions made by physicists and is close to our 
notation. 
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Interpretation of network characteristics. Values of network characteristics or the 
presence or absence of community structure are typically interpreted with respect 
to the ability of the network to transport information (or other entities, e.g. masses) 
and its resilience to random or targeted attack (or error), i.e., the removal of nodes 
or edges. Large values of the clustering coefficient are considered to be indicative of 
resilient networks. A removal of a node will most probably not prevent information 
transport between arbitrary nodes since parallel routes likely exist. Following the 
same line of reasoning, assortative networks are considered to be robust against 
attack since they possess a resilient core of connected high-degree nodes [i23il . This 
core, in addition, may facilitate the spread of information over the network. In 
contrast, dissortative networks are reported to be more chain-like, vulnerable, and 
fragile. Low values of the average shortest path length indicate that information 
can be exchanged between two arbitrary nodes by crossing just few edges. This 
property makes them very efficient in terms of information transfer. 

2.1.2 Network models 

Over the past decades, numerous network models have been developed and inves- 
tigated (see references lfTU6U28ll and references therein). Network models can help to 
improve our understanding of potential mechanisms shaping the topology of real 
networks. Moreover, they can be used as a means to implement null hypotheses 
when assessing the significance of properties found in real networks. For the lat- 
ter purpose, network models are commonly employed whose generation includes 
stochastic parts to various extent and obeys some chosen constraints. In the fol- 
lowing, we briefly present three network models and focus on some of the many 
findings which are of importance in the context of this thesis. 

Erdos-Renyi graphs. Considered as prototypical random networks, Erdos-Renyi 
graphs have been intensively studied in the mathematics literature Il99l - I103l and 
are easy to generate. They are used when lacking any information about the mech- 
anisms leading to the creation of edges. Two different models are referred to as 
Erdos-Renyi graphs. In the first model, edges are randomly created between dif- 
ferent nodes (avoiding multiple edges) until a fixed number of edges is reached 
II100H102I . In the second model, for each pair of nodes, an edge is created with 
probability < p < 1 19911 . Both models are closely related to each other and co- 
incide in the limit of large N taken at fixed k (see references in |6l). While the first 
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model has found frequent use in field studies, the second model is more frequently 
used in analytical considerations. We will use the second model throughout this 
thesi^ and with "Erdos-Renyi networks" we will refer to this second model from 
now on. 

By construction, edges in Erdos-Renyi graphs are equally likely and indepen- 
dently chosen to become edges. Hence, the degree of a given node has a Binomial 
distribution, i.e., the probability of a node in an Erdos-Renyi graph of size N to 
possess a degree k reads 



Since edges are connected to nodes regardless of their degree, Erdos-Renyi graphs 
represent uncorrelated graphs. Thus, the expectation value of the assortativity coef- 
ficient vanishes. The clustering coefficient Cer of Erdos-Renyi graphs can be easily 
derived, Cer = p, and vanishes for N — ^ oo at fixed k. The dependence of the aver- 
age shortest path length on p and N is much more complicated 161 110411 , but a typical 
distance I in Erdos-Renyi graphs is I ~ InN/ \nk fSl, i.e., it scales logarithmically 
with N. Thus, Erdos-Renyi graphs possess the small-world property. Finally we 
mention that almost any Erdos-Renyi graph is connected for k S> ln(N) Il74ll . 

Generalized random graphs. Empirical networks usually do not show a Binomial 
degree distribution, which inspired the investigation of network models allowing 
for non-Binomial degree distributions. Networks of such models possess randomly 
assigned edges, and the assignment of edges is solely constrained by a prede- 
fined degree distribution (or degree sequence). Prominent models may loosely 
be categorized into two classes with respect to whether they are based on stub- 
matching or link-switching. Stub-matching is employed in the renowned configura- 
tion model Ill05ill06ll for generating networks (cf. references in [|l].|5l; see II107I for 
a brief historical overview). A degree sequence {kj} is obtained from the prede- 
fined degree distribution and each node i is assigned a number kj of stubs. Stubs 
of pairs of nodes are connected at random until all stubs are connected. If multiple 
edges between nodes or self-connections occur, the network is discarded, and the 
process is restarted. Several approaches have been proposed to make this ansatz 

^ We did not observe qualitative differences between both models in the numerical experiments 
carried out for this thesis. 




(2.9) 
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computationally more efficient (see, e.g, references II1081I109I ). Methods based on 
link-switching (also known as Markov-Chain Monte Carlo methods, see II108H1111 
for an overview) are more frequently used in field studies and start with a net- 
work in which edges already exist. The simplest approach ll86llll2[[TT3l considers 
two randomly selected edges and {k,m). If edges {i,k) and (;,ot) do not exist, 
these edges are added and edges [i,]) and {k,m) are deleted, which is called link- 
switching. This step leaves the degrees of nodes unchanged and is repeated many 
timeqj. The resulting network is said to be randomized, and we refer to such graphs 
as degree-preserving randomized networks in the following. Variants of this approach 
have been proposed II1101I111| in order to ensure a uniform sampling of networks 
with predefined degree sequence. 

By construction, generalized random graphs do not show degree-degree correla- 
tions apart from those induced by structural constraints due to the finite size of the 
graphs. Thus, the expectation value of the assortativity coefficient approaches zerc§ 
or vanishes if a network without any degree-degree correlations is realizable given 
a defined degree sequence and the finite size of the graph. The clustering coefficient 
and an approximation of the average shortest path length then solely depend on the 
graph size and on the first two moments of the degree distribution [IJ. Moreover, 
generalized random graphs show the small-world property. 



Small-World model. The clustering coefficient of Erdos-Renyi networks and of 
generalized random graphs vanishes in the limit of large graph sizes (taken at 
fixed k). In contrast, many real networks possess large clustering coefficient de- 
spite of their large graph size. This has spurred the definition of models possessing 
adjustable clustering coefficients. The small-world model proposed by Watts and 
Strogatz II741I80II allows for both, a large value of the clustering coefficient and small 
values of the average shortest path length. In the original model, network construc- 
tion starts with a ring lattice of N nodes. Each node has 2m edges where m edges 
connect it to the mth nearest nodes clockwise, and the remaining m edges connect 
it to the mth nearest nodes counter-clockwise. A node is chosen, and with rewiring 
probability < p < 1, the edge connecting it to its first nearest neighbour in a 
clockwise sense is reconnected to a randomly chosen node (while avoiding self- 

^ In this thesis, the number of randomization steps was set to twice the number of edges present 

in the network, i.e., eN{N — 1). 
^ In our simulation studies, we usually observed deviations from zero in the order of 10^^ for 

N = 100. 
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Figure 2.2: Means of C(p) := C(p)/C(0) (open symbols) and L(p) := L(p)/L(0) 
(filled symbols) depending on the rewiring probability p (lines are for eye-guidance 
only). We used the Watts-Strogatz scheme (N = 1000, k = 4, 1000 realizations 
for each p) to generate networks from which clustering coefficients and average 
shortest path lengths are determined. Standard deviations for all quantities are 
smaller than symbol size. 

connections and multiple edges). This procedure is repeated for all nodes of the 
ring. Then, the second nearest neighbours are considered and reconnected with 
probability p as described before. By circulating around the ring, the rewiring pro- 
cess proceeds outward to more distant neighbours after each lap until each edge 
has been considered once |[74|. Note that even for p — > 1 networks are not equiva- 
lent to Erdos-Renyi graphs because they retain some memory from the construction 
process (each node has at least m neighbours) II114II . 

With the rewiring probability p, it is possible to interpolate between the case of 
a lattice (p = 0, large clustering coefficient) and that of a random graph (p = 1, 
small average shortest path length). For illustration purposes, we show in figure IZ2l 
the normalized clustering coefficient C(p) = C(p)/C(0) and the average shortest 
path length L(p) = L(p)/L(0) as a function of p for N = 1000 and k = 4. Net- 
works obtained for small non-zero values of p possess large values of the clustering 
coefficients but also display small values of the average shortest path length due 
to short-cuts introduced by the rewiring process. These networks are called small- 
world networks IHH^. In addition, it was shown that networks of the small- world 
model have the small-world property already for small non-zero values of p which 
depend on the size of the graph [114I - I116I 1. 
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Inspired by the small-world model, many studies evaluated properties of net- 
works derived from empirical data in order to classify them into distinct network 
classes (random, lattice, and small- world). The evaluation of the small- world prop- 
erty requires to investigate the existence of a scaling behaviour of the average short- 
est path length, an effort involving the assessment of the average shortest path 
length for varying numbers of nodes over multiple orders of magnitudes. This is 
typically not viable for empirical networks. Instead, clustering coefficient C and 
average shortest path length L of empirical networks are compared to those of an 
ensemble of random networks with the same number of nodes and edges. To this 
end, 7 = C/Q and A = L/Lr are determined where Q and Lr denote the mean 
values of the clustering coefficient and the average shortest path length, respec- 
tively, obtained from the ensemble of random network^ 7^1 and A ~ 1 are 
then considered as indicative of a small-world network, whereas 7 ~ 1 and A ~ 1 
or 7 ^ 1 and A ^ 1 are considered to indicate a random network or a lattice 
topology, respectively. This approach has been pursued in a vast number of studies 
across different disciplines. In this context, the notion "small-world network" signi- 
fies the presence of both, a large clustering coefficient and a small average shortest 
path length. 



2.1.3 Interrelationships between network characteristics 

Little is known about interrelationships between network characteristics. With the 
increasing popularity of network analyses, however, the question which network 
characteristics offer complementary or redundant information has become more 
important. For a few network models and network characteristics, analytical inter- 
relationships were found. For instance, C and L of generalized random graphs are 
functions of the first two moments of the degree distribution ||l|, and for the small- 
world model, C could be related to the mean degree and rewiring probability 111 141 . 
Besides exact relationships, bounds were reported which constrain network prop- 
erties with respect to other properties. For example, some spectral properties of 
networks are bounded by properties of the degree sequence II117I . Beyond that, pos- 
sible interrelationships were mainly investigated in numerical studies ||71 |118|[TT9| . 
Such studies determine correlation coefficients between different characteristics of 
network models or networks derived from empirical data. 

^ If degree-preserving randomized networks are used, the corresponding quantities will be denoted 
as Cdp, Ldp, and 7dp, Aqp, respectively. 
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Since many empirical networks show — unlike generalized random graphs — pro- 
nounced degree-degree correlations, a number of studies investigated possible re- 
lationships between the assortativity coefficient and other network characteristics. 
Empirical networks were found to display either assortative behaviour and large 
clustering coefficient (social networks) or dissortative behaviour and low clustering 
coefficient (non-social networks). It was argued, that assortativity might be a conse- 
quence of a pronounced community structure II120I or that networks "need" assor- 
tative degree-degree correlations in order to achieve large values of the clustering 
coefficient II121II . In numerical studies, the assortativity coefficient was found to be 
positively correlated with the clustering coefficient in networks with a scale-free de- 
gree distribution |1221I123I and more general but fixed degree sequences |124| . The 



same studies report the average shortest path length to be positively (negatively) 
correlated with positive (negative) values of the assortativity coefficient. These find- 
ings are confirmed by other studies numerically investigating relationships be- 
tween the clustering coefficient and degree-degree correlations ||113ill25lll26| . It 
was demonstrated that the clustering coefficient can be sensitively affected by de- 
gree-degree correlations and an alternative definition was proposed [127 | | . 

A major advance in unravelling a possible interrelationship between clustering 
coefficient and assortativity coefficient was achieved in a recent study II128I pub- 
lished at the time of this writing. The assortativity coefficient can be rewritten as a 
function of the clustering coefficient, of the number of paths of length 3, 2, 1, and 
of the number of stars of four nodefl In short, three quantities determine the ten- 
dency of a network to be assortative or dissortative. For the assortativity coefficient 
a holds 

oc P3/2 + C - P2/1, (2.10) 

where P3/2 (P2/1) is the number of paths of length 3 (2) divided by the number of 
paths of length 2 (1), and C is defined as the fraction of transitive triples (cf. sec- 
tion IZLT). P2/1 quantifies the relative branching of a network and obtains its largest 
value for star topologies and its lowest value for a linear chain. P3/2 is considered 
to reflect intercluster connectivity as argued in II128II . Thus, the interplay between 
the interconnectedness of clusters, the transitivity, and the relative branching deter- 
mine whether the network is dissortative {a < 0, strong tendency towards relative 
branching) or assortative {a > 0, strong transitivity and /or intermodular connec- 
tivity). Finally we mention that numerical studies reported assortative networks to 



A star of four nodes consists of a central node to which three nodes are connected. 
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show a stronger tendency to disintegrate into different connected components than 
dissortative networks II129L a finding supported by results from spectral graph the- 
ory II1301I131II . 

2.2 Inferring interaction networks 

As described before, an interaction network is a means to characterize the dynam- 
ics of a system. This representation requires the identification of nodes and edges 
which can be straightforwardly achieved for various systems. When inferring in- 
teraction networks for spatially extended systems (e.g., in the neurosciences, in 
geophysics, or in climate science), however, a reliable and meaningful identification 
of nodes and edges can pose a non-trivial challenge. Nodes are usually associated 
with sensors supposed to sample the dynamics of different subsystems. Edges are 
assumed to reflect interactions between these subsystems. These interactions can- 
not typically be inferred directly, e.g., by controlling the system and varying its 
parameters (coined active experiments in reference II132I '). Instead, interdependencies 
between the signals recorded by the sensors are assumed to indicate interactions 
between systems. Signals are usually available as multivariate time series, and inter- 
dependencies are estimated using time series analysis techniques (see section |2. 2. 1[) . 
From these estimates, edges can be derived in a number of ways discussed in sec- 
tion |ZZ2l 

2.2.1 Estimating signal interdependencies 

A large number of estimators of signal interdependence differing in concepts, sta- 
tistical efficiency (i.e., the amount of data required), and robustness (e.g., against 
noise contaminations) is available II132H1381 . Among those, methods from linear 
time series analysis are very frequently used in network field studies. Let Xi{t) and 
Xj{t) denote time series of length T {t = 1, . . . ,T) measured with sensors i and j. 
A prominent example is the correlation coefficient (also known as linear or Pearson 
correlation coefficient), corr(x;, xy), which estimates the linear dependence between 
the amplitudes of Xj and x,. Its absolute value is defined as 



T 




corr{xi,Xj)\ := T ^ Y^(xi(t) - Xi){xj(t) - Xj)a- V- 



(2.11) 



t=l 
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where and denote mean value and the estimated standard deviation of time 
series X/. Interdependencies occurring with a time lag between signals can be char- 
acterized with methods based on cross correlation functions II133I . The maximum 
value of the absolute cross correlation between two time series has also been used 
in field studies and is defined as 



pfj := max 



^{Xi,Xj){T) 



with 



^ (Xf, Xj) (' 



^(x„xO(0)^(x^-,x^)(0) 



E?ri^x,(^ + T)x^(0 ,T>o 



(2.12) 



(2.13) 



_^(xy,X/)(-T) ,T<0. 

Note that in most studies time series are normalized to zero mean before deter- 
mining the maximum absolute value of the cross correlation, in which case equa- 
tion (|2.12[) becomes the maximum absolute value of the cross covariance function. We 
will follow this approach and always determine the maximum absolute value of the 
cross covariance function, p^^ and pf^^ are both confined to the interval [0, 1] where 
values close to or equal to indicate no linear dependencies between Xj and Xj (for 
T sufficiently large), respectively, and values approaching 1 indicate the presence 
of strong linear interdependencies. 

Other methods take into account non-linear aspects of the dynamics when esti- 
mating interdependencies between signals. Among them, methods aiming at char- 
acterizing phase synchronization II139I have been frequently used in field studies 
of brain electric or magnetic activity. Time series x, are assumed to describe oscil- 
latory signals from which phase time series (pi can be determined using different 
techniques (e.g., by employing wavelets II140I , the Fourier- or the Hilbert trans- 
form II1411I142B '). Under certain conditions, these different approaches are equiva- 
lent M1431I144|| . Once phases are extracted, two signals are considered to be from 
phase synchronized systems if the difference between the corresponding phases is 
bounded, (pi{t) — (pj{t) < const (phase entratnment II145I ). In this view, the strength 
of signal interdependence is said to be stronger the more bounded the distribution 
of the phase differences. Phases represent directional data and their distributions 
can be characterized employing tools from directional statistics II146L A frequently 
used estimator is the mean phase coherence 11140111471 which is defined as the mean 
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resultant length M146I of the distribution of phase differences. 



T 

1 gi{<PM-<Pj{t}) 
^ t=l 



(2.14) 



Rij takes on values between (no phase synchronization) and 1 (perfect phase 
synchronization, strong signal interdependencies). 



2.2.2 Deriving edges 

Interaction networks can be derived in many different ways from the estimates of 
signal interdependence. Let pij denote some estimate of signal interdependence, 
i,j G {1, • • - /N}, and let us consider some function which maps the estimates pij 
to edges of a network described by the entries Aij of the adjacency matrix. A very 
frequently pursued approach to derive unweighted interaction networks is to define 
a threshold G R above which values of estimators are converted into edges, i.e., 

A,j = H{p,j-9), (2.15) 

where H{x) takes on the value 1 for x > and is zero else. This approach is often 
referred to as thresholding and is common in many scientific fields Il20ll44ll64 [114811 . A 
variant of this approach sometimes used if pij can take on negative values is 

Aj = H{\p,j\-e). (2.16) 

Instead of specifying the threshold directly, most studies require the resulting in- 
teraction network to possess a predefined mean degree k or, equivalently, a prede- 
fined edge density e in which case 9 is chosen accordingly. Predefining e is often 
considered advantageous since it was demonstrated that e can sensitively affect 
network characteristics II1491I150I . Another strategy for determining 9 is known as 
adaptive thresholding Il59l where the largest value of 9 is chosen for which the result- 
ing network is still connected. Other approaches to derive unweighted interaction 
networks rely on significance testing and have been proposed recently H38lll51i[T52| . 
Such methods set Aij = 1 only for those values of pij which are considered to be 
significant according to some test at a given significance level. Other methods are 
based on constructing a minimum spanning tree out of the matrix of estimates of 
signal interdependence II153I or on rank-ordered network growth II154I . 
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Weighted networks can be derived in a number of ways. The simplest one is to 
assume all edges to exist and to interpret the estimates of signal interdependence 
as weights of the edges, i.e.. 





W,, = ' ' (2.17) 



Variants of this approach are, e.g., to set Ai^ = if = or, alternatively, if the 
value of pij is considered to be not significant according to some test. Besides, ap- 
proaches were proposed to derive interaction networks having weight distributions 
with fixed first moment or with an additionally fixed second central moment [i63ll , 
i.e., 

W^j = prj-p + h or W^j=^^^ + h (2.18) 

where p and ap denote the mean value and the standard deviation of the values 
Pij' ^ 7^ // respectively The resulting weight distribution is centered around the 
value 1. More refined strategies were also suggested that map the values of signal 
interdependencies according to their rank order to a predefined distribution of edge 
weights IIT55I . 
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During the last years, the dynamics of a large number of complex systems have been 
analyzed using tools from network theory Interaction networks have been studied 
in different disciplines such as climate science Il37ll39ll64lll56[|157i , geophysics (seis- 
mology ll40U42ll44llT58ll). biology CHIll, quantitative finance lll48U153llT54llT59tiT6ll . 
and neuroscience IIT9ll20ll24ll25B . Studies published in these diverse disciplines ad- 
dress the same questions, namely whether different dynamical states are reflected 
in the topology of interaction networks and thus can be classified, predicted, or even 
controlled. To this end, promising features of interaction networks are considered 
those which cannot be expected to be present by chance. To identify such features, 
properties of interaction networks are usually compared with those obtained from 
random network models (Erdos-Renyi networks or generalized random graphs, cf. 
chapter |2]). 

Brain structural and functional networks (see, e.g., references IIT9ll20ll24| for an 
overview), climate networks Il35ll39ll64ll , and seismic networks Il42ll44l have been 
repeatedly reported to show small-world characteristics based on comparisons of 
their clustering coefficients and average shortest path lengths with those of random 
networks. While assortativity has frequently been investigated in social and tech- 
nical networks (the former were typically found to be assortative, the latter to be 
dissortative) for many years II120I , studies assessing the assortativity in interaction 
networks were published in recent times. Seismic networks were reported to be 
assortative [43J, whereas financial networks were found to be dissortative or assor- 
tative depending on the thresholding-strategy pursued for network inference II159II . 
Studies inferring networks using different neuroimaging techniques consistently 
reported brain functional networks to be assortative ||65l47n . Brain structural net- 
works were reported to be dissortative [66] or assortative ll56lll62l[T63B , an incon- 
sistent finding which might — among other influencing factors — ^be related to the 
employed differing neuroimaging and network inference techniques. 

A rapidly increasing number of studies in the neurosciences go beyond a mere 
classification of brain networks into small-world or assortative networks, but aim to 
relate properties of interaction networks to physiological or pathological processes. 
Network properties were found to reflect physiologic processes such as sleep II48U49I 
or aging iHSllSTI . Moreover, many studies reported changes of network properties 
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reflecting pathological states such as Alzheimer's disease [|52ll53l , schizophrenia 
Il54] - I56ll , or epilepsy |[57l - |63| . For example, topologies of interaction networks were 
reported to be closer to random networks for young and old subjects and more 
lattice-like for subjects of intermediate age IISTl . Interaction networks appeared to 
have larger values of the average shortest path length for epilepsy patients II63II 
than for healthy controls. The same was found for Alzheimer patients [521, where, 
in addition, lower values of the assortativity coefficient Il67ll compared to healthy 
controls were reported. More lattice-like topologies were found during sleep 114811491 
and during epileptic seizures H57l - |6ll . Moreover, recent findings indicate that the 
temporal evolution of some network characteristics may also reflect daily rhythms 

cn. 

In the following, we highlight typical ways how interaction networks are derived 
from empirical data. We demonstrate how analysis results are interpreted by con- 
sidering exemplary studies of functional brain networks of healthy subjects and 
epilepsy patients (section 13. From these observations, we draw the attention to 
fundamental challenges which are connected to the network analysis approach and 
which have not yet been thoroughly studied. Guided by our findings, we outline the 
following chapters and explain our strategies to narrow down the overwhelming 
number of methods and techniques used in applied network science (section [3. 2[) . 

3.1 Exemplary network analyses of brain electric and magnetic 
activity 

Typical observables assessed by electrophysiological techniques such as electroen- 
cephalography II165U166II or magnetoencephalography II167I are electric or magnetic 
field components (electroencephalogram (EEG) or magnetoencephalogram (MEG)), 
respectively, which are generated by neuronal activity. To pick up this activity, sen- 
sors are placed inside the skull (intracranial EEG), on the scalp (scalp EEG), or 
outside but in the vicinity of the brain (MEG). At each sensor, the electric or mag- 
netic activity is sampled at a prespecified sampling rate. The following studies 
investigate whether network characteristics reflect different physiological (see sec- 
tion 13.1.1)1 or pathophysiological (see sections 13.1.21 and I3.1.3[) states of the brain. 
For all studies, all patients and healthy subjects had signed informed content that 
the data might be used and published for research purposes; and the studies were 
approved by the local medical ethics committee. 
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3.1.1 Network characteristics reflect different physiological states 

We present exemplary results from a study II631I155II in which EEG and MEG data 
were obtained from subjects during controlled conditions, namely relaxed with eyes 
open or closed. We refrain from presenting all details (which can be found in 11631 
11551 ) but instead show selected findings. 



Data. EEG- and MEG-data of 23 healthy subjects (of age 33 ± 9 years, 11 women) 
were collected. Subjects were instructed acoustically to either open or close their 
eyes for periods of 15 minutes. The chronological order of the two periods was 
randomized across subjects, and surface EEG as well as MEG was recorded simul- 
taneously. MEG data were sampled at 254.31 Hz (16 bit A/D conversion; bandwidth 
0.1-50 Hz) using a 148-channel magnetometer system of which data of Nmeg = 130 
channels entered subsequent steps of analysis. EEG data were sampled at the same 
sampling frequency (bandwidth 0-50 Hz) from Neeg = 29 electrode sites accord- 
ing to the 10-10 system II168II of the American Electroencephalographic Society, and 
right mastoid was used as reference. 



Analysis. In order to allow for a time-resolved analysis, multivariate time series 
were divided into consecutive windows of 16.1 s duration (T = 4096 sampling 
points), which can be regarded as a compromise between the approximate sta- 
tionarity of the system and the statistical accuracy of the used estimator of signal 
interdependence 111 691 - 11 71| . In order to exclude movement artifacts at the beginning 
and at the end of the two conditions (eyes closed, eyes open), analysis was restricted 
to 40 windows for each condition. Signal interdependencies were estimated by the 
absolute value of the correlation coefficient (cf. equation (|2.11|) ) between all pairs 
of time series within each window. Unweighted interaction networks were derived 
via thresholding the values of signal interdependence such that each interaction 
network possessed a prespecified mean degree k (EEG data: ^eeg = 5, Seeg ~ 0.18; 
MEG data: ^meg = 15, Cmeg ~ 0.12). Clustering coefficient (C) and average shortest 
path length (L) were determined for each network. From C and L of each subject, 
average values (C) and (L) were calculated for each condition separately. Finally, 
group averages C and L were determined from all values of (C) and (L) for each 
condition. Significance of differences between the distributions of (C) ((L)) of the 
two conditions was assessed by using a Wilcoxon signed rank test for matched pairs 
(p < 0.05). 
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Figure 3.1: Mean values of the clustering coefficient C (left) and average shortest 
path length L (right) obtained from interaction networks derived from EEG or MEG 
data recorded under different physiological conditions. Significant differences in C 
and L between the different conditions are marked with stars (*). 



Results. In figure 13.11 we show C (left panel) and L (right panel) obtained for 
the different conditions (eyes closed, eyes open) and derived from EEG- as well as 
MEG-data. Significant differences between both conditions can be observed for L 
based on the EEG data. This indicates that physiological states are indeed reflected 
in this network property We note, however, that no significant differences could 
be observed for C based on the EEG-data and for C and L of interaction networks 
derived from MEG recordings. We observe both network characteristics to take on 
higher values for networks derived from MEG data — indicative of a more lattice- 
like topology — than for networks derived from EEG data. Interestingly, this can be 
observed despite Cmeg < ^eeg ^rid despite the tendency of networks with higher 
edge density to show larger values of the clustering coefficient. 

In references Il63lll55l , a plethora of different network construction methods (in- 
cluding different time series analysis as well as thresholding techniques) were em- 
ployed. It was a consistent finding that significant differences in network properties 
between different conditions were less frequently observed for networks derived 
from MEG data compared to networks derived from EEG data II155I . This might 
be attributed to various factors including the local currents (generating the electric 
and magnetic fields) and their location and orientation relative to the sensors II167I . 
However, it might also be related to the spatial sampling of the dynamics, to the 
number and spatial arrangement of sensors: magnetometer systems, as pointed out 
in II155II , allow for a higher spatial sampling than EEG sampling schemes, which is 
reflected in Nmeg ^ ^eeg- In addition, studies suggest that the strength of signal 
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interdependence estimated between time series recorded by the sensors may de- 
pend on the spatial distance between sensors II1721I173II . We will study the influence 
of the spatial sampling on network properties in the next chapter. 

3.1.2 Network clusters might be predictive of impending seizures 

Epilepsy is a brain disorder which is characterized by epileptic seizures, i.e., tran- 
sient occurrences of signs and /or symptoms due to abnormal excessive or syn- 
chronous neuronal activity in the brain |174| . 25% of the epilepsy patients can- 
not achieve sufficient seizure control (neither from medication nor from resective 
surgery). These patients would particularly benefit from methods which allow to 
predict epileptic seizures. Since early studies conducted in the 1970s, research on 
seizure prediction has gained momentum (see 11175111771 and references therein for 
an overview), but the problem of seizure prediction is still unsolved. While the con- 
cept of a well-defined localized area in the brain responsible for seizure generation 
was (and still is) widely accepted, there is now increasing evidence that the occur- 
rence of seizures may be better understood as a network phenomenon II138U178U179II . 
In reference Il98l , we studied whether clusters in interaction networks derived from 
EEG data are predictive of epileptic seizures. In this context, a cluster represents a 
set of brain regions (nodes) which might even be spatially distant. Here we refrain 
from recalling all details of the study but present exemplary results and discuss 
findings which point towards influences of the analysis methodology on the net- 
work structure. 

Data. Multi-day multi-channel EEG data (total recording time: 90 days, mean: 
154 h/patient, range: 45-267 h, average number of recording sites: 63, range: 32- 
76) were recorded intracranially from 14 patients (patients A-N) who underwent 
presurgical evaluation of pharmacoresistant focal epilepsies. Recordings captured 
a total number of 119 seizures (mean: 8.5, range: 6-14 seizures/patient), and the 
data were sampled at 200 Hz (16 bit A/D conversion; bandwidth 0.3-70 Hz) using 
a referential montage. Analysis was carried out retrospectively. 

Analysis. To allow for a time-resolved analysis, multivariate time series were di- 
vided into consecutive windows of 20.48 s duration (T = 4096 sampling points; see 
section 13.1.11 for the criteria used to choose the length of windows) and band-pass 
filtered in the well-known EEG frequency bands, namely 5 (0.5-4 Hz), 6 (4-8 Hz), a 
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(8-13 Hz), |6i (13-20 Hz), and (20-30 Hz) ElOl. For each frequency band and each 
window, we estimated signal interdependencies for all pairs of time series by using 
the mean phase coherence Ill40[|146ill47 l. Let R denote the matrix whose entries are 
the values of the mean phase coherences estimated for all pairs of time series within 
a window. We assume all edges to exist (adjacency matrix Aij = IVz 7^ An = 0) 
and derive the weight matrix W by setting W = R. This definition leads to a 
weighted undirected network. From each interaction network, we determine clus- 
ters by using a spectral clustering method which optimizes the modularity function 
(see sections [2 . 1 . 1 1 and 17. 1 1 as well as 1981 for details). In order to assess whether the 
occurrence or absence of clusters prior to seizures are predictive of seizures, we 
assumed that a pre-ictal state (i.e., a state prior to a seizure) exists and lasts for a 
certain amount of time Tp. We discarded data from recordings within 60min after 
the onset of each seizure in order to exclude effects from ictal (i.e., during seizures) 
as well as post-ictal (i.e., after seizures) periods. In addition, if data in an assumed 
pre-ictal period amounted to less than 70 % (e.g., due to recording gaps or due to 
seizure clustering), it was excluded from subsequent analyses. Tp was varied from 
15min to 240 min (in steps of 15min), and we determined the number Hp of pre- 
ictal and the number n, of inter-ictaU windows. For each cluster c identified in the 

(c) (c) 

Hp + rii windows, we determined its occurrence in all windows. Let rip and n^- 
denote the number of occurrences of cluster c in pre-ictal or inter-ictal time peri- 
ods, respectively. We define the true positive rate, TPR^^^ := nf' /up, and the false 
positive rate, FPR^'^^ := n"^^ /rii for each cluster c, for each assumed duration Tp of 
a pre-ictal state, and for each frequency band. We quantify the predictive power of 
each cluster by W^^) := |rPJ^(^) - fPR^^)] G [0,1], where W^^) = 1 (W^^) = 0) in- 
dicates a cluster to perfectly indicate (or not to indicate) a pre-ictal state. Since the 
same cluster structure is unlikely to show up in exactly the same pattern in different 
windows due to noise contributions, we define groups of clusters, which facilitate a 
robust identification of the most-frequently occurring clusters in a recording II181I . 
For exemplary recordings, cluster groups are algorithmically determined such that 
all members of each group of clusters do not differ in more than 6 nodes. 



Results. We consider receiver operating characteristic (ROC) spaces which are de- 
fined by FPR and TPR as x and y axis, respectively [|1821 . In figure |3^ we show two 
exemplary ROC spaces in which each point is associated with a cluster and a given 



All time periods except pre-ictal, ictal, and post-ictal periods. 
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Figure 3.2: Exemplary ROC spaces where each point in space is associated with a 
cluster and a duration Tp of a presumed pre-ictal period (color- and symbol-coded, 
see legend). Durations are given in minutes. Orange-shaded areas mark exemplary 
cascades of points in ROC space. Left: ROC space obtained for data in the 0-band of 
patient A. The gray line visualizes the distance (V) to the diagonal for an exemplary 
cluster. The predictive power of a cluster is higher the larger V. Right: ROC space 
obtained for data in the /32-band from patient E. 

duration Tp of the presumed pre-ictal period. The diagonal represents the set of 
points obtained for a random predictor. Thus, clusters are of interest whose points 
deviate from the diagonal, as reflected by the shortest distance V^^"^ between the 
respective point and the diagonal in ROC space, V^'^'' = W^'^V \/2- Points above the 
diagonal represent clusters whose frequency of occurrence is higher in the pre-ictal 
periods than in the inter-ictal periods, and the opposite holds for clusters whose 
points are below the diagonal. We observe points in ROC spaces (see figure I3.2|) 
which are associated with very similar FPR values but varying TPR values and 
which we call cascades in the following. Interestingly, points of a cascade belong to 
the same cluster but to different durations Tp. Moreover, we observe W to increase 
for decreasing Tp, which indicates that the frequency of some clusters increases (cf. 
left panel) or decreases (cf . right panel of figure I3.2[) prior to seizures. This network 
reorganization might point towards a gradual built up of some process prior to an 
impending seizure. 

In figure 13.31 we show exemplary time courses of occurrences of the clusters with 
largest W values for two patients. While the enlarged views (figure |33] (A) and (B) 
top) of recordings from both patients indicate relative changes in frequencies of 
clusters prior to seizures, we observe a large variability of the frequency of occur- 
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Figure 3.3: Exemplary time courses of occurrences (indicated as vertical green lines; 
15 minutes moving-average smoothing as blue line) of the most predictive cluster 
(cf. figure |3.2[) . Seizures are marked by vertical red lines and gray areas indicate 
recording gaps. Numbers on the x-axis indicate time of day. (A) Top: Enlarged 
view of a recording prior, during, and after a seizure (patient A, 0-band). Bottom: 
Complete recording. (B) Same as (A) but for patient E (/32-band). 



rence of clusters (shown as moving-average (15 minutes duration) of the discrete 
cluster occurrences) on a longer time scale (figure 13.31 (A) and (B) bottom). Thus, 
the question whether clusters in interaction networks are predictive of seizures, 
cannot be unequivocally answered. The variability of the frequency of cluster oc- 
currences might reflect influencing factors such as alterations of the antiepileptic 
medication, the specific nature of some epileptic process, physiological activities, 
or daily rhythms II1641I181II . 

Despite these remarkable findings, which deserve future investigations, there 
may exist influencing factors related to the acquisition of the data, which can affect 
interaction networks. We expect such influences to be present during the whole 
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Figure 3.4: (I) Top: exemplary schematic view of the electrode grid of patient A. 
Seizure onset zone was determined by the presurgical workup and is marked 
as magenta area (the lesion is marked as gray shaded area). (II) Top: exemplary 
schematic view of the electrode grid of patient F. Areas involved in language pro- 
cessing as determined by electrical stimulation are marked in green. Bottom: Four 
exemplary cluster groups which are among the 12 most frequently occurring clus- 
ter groups in the recordings of patient A (La, Lb, I.c, I.d) and of patient F (Il.a, Il.b, 
II.c, Il.d), respectively. Colors indicate participation frequency of brain sites within 
a cluster group (from black (0) to white (1)). 
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length of the recordings. Thus, to investigate this issue, we consider a temporal 
mean of the cluster content of all recordings for each patient, i.e., we investigate 
most frequently occurring clusters. As detailed above, we define groups of most 
frequently occurring clusters in order to minimize side effects due to noise contri- 
butions II181II . In figure l3!4l we show examples of groups of the most frequently oc- 
curring clusters for patient A (left column) and patient F (right column). We observe 
a group of clusters to cover a brain area (seizure onset zone) in which earliest signs 



of seizure activity can be identified (patient A, figure 13.41 Lb), which might reflect 
pathological activity, as well as cluster groups which cover brain structures sub- 
serving physiological activities (e.g., language processing, patient F, figure 13.41 Il.b 
and II.c). However, groups of clusters are clearly visible which reflect the anatomi- 
cal organization of the brain (patient A, figure 13.41 1.c and I.d). Their spatial extent 
corresponds to different brain lobes (temporal and frontal lobe) and parts of their 
boundaries follow the lateral sulcus. Moreover, for both patients A and F, we ob- 
serve groups of clusters to reflect reference electrodes (electrodes A7, A8 in patient 
A, figure l334l La; electrodes Al, A2 in patient F, figure l3!4l ll.a). Taken together, these 
findings suggest that factors concerning the acquisition of the data (e.g., spatial 
sampling relative to the anatomical organization, referencing) might — next to phys- 
iological and pathological activities — also leave an imprint in the properties (here: 
clusters) of derived interaction networks. 

3.1.3 Network characteristics undergo changes during seizures 

An improved understanding of the mechanisms underlying seizure initiation, 
spreading, and termination in human epilepsy can help to develop more efficient 
treatment strategies. To advance knowledge about the epileptic processes, seizure 
dynamics might be considered as a network phenomenon, a point of view 
corroborated by recent modeling studies Ill83i4189n . In reference II59II , we 
studied — in a time-resolved way — characteristics of interaction networks which 
were derived from EEG recordings capturing seizure dynamics. We briefly recall 
the analysis methodology and present exemplary results of this study. 

Data. Multi-channel EEG data (average number of recording sites: N = 53 ± 21) 
were recorded prior to, during, and after 100 focal onset epileptic seizures (mean 
duration: 110 ± 60 s) from 60 patients who underwent presurgical evaluation of 
pharmacoresistant focal epilepsies. The data were acquired (using strip, grid, or 
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Figure 3.5: 7dp (left), App (center), as well as e (right) averaged separately for pre- 
seizure, discretized seizure, and post-seizure time periods of 100 epileptic seizures. 
All error bars indicate standard error of the mean. Lines are for eye-guidance only. 

depth electrodes) from the cortex and from within other relevant brain structures 
(sampling rate: 200 Hz; 16 bit A/D conversion; bandwidth 0.5-70 Hz). Prior to anal- 
ysis, a bipolar re-referencing was applied which might diminish the influence of 
the recording reference mentioned in the previous sectiori^. 



Analysis. Multivariate time series were divided into non-overlapping consecutive 
windows of length 2.5 s (T = 500 sampling points; see section 13.1.11 for the criteria 
used to choose the length of windows). For each window, time series were normal- 
ized to zero mean and unit variance, and signal interdependencies were estimated 
by calculating_^the maximum value of the cross correlation function for each pair 
of time serieqj. We derived an unweighted interaction network for each window 
using adaptive thresholding: for each window, the largest threshold was chosen for 
which the resulting network was connected (while possessing a minimum number 
of edges). For each network, we determine its edge density e as well as normalized 
network characteristics 7dp := C/Cdp and Adp := L/Ldp, where Cpp and Lqp are 
obtained from degree-preserving randomization (cf. generalized random graphs in 
section I2.1.2[) of the network. Seizures were partitioned into 10 equidistant time 
bins, and averages of network characteristics, e, 7dp, and Aqp, were determined for 
each time bin. In addition, averages of network characteristics were also determined 
for networks derived from the pre-seizure and post-seizure time periods. 



^ Whether this is indeed the case, requires further investigations. 

^ We observed quahtatively similar results when using the maximum value of the absolute cross 
correlation fimction (p™) as estimator of signal interdependence. 
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Results. In figure 13.51 time resolved network characteristics 7dp (left panel), App 
(center panel), as well as e (right panel) obtained for all 100 seizures are presented. 
We observe 7dp and A^p to follow a concave-like movement. Both characteristics 
increase during the first part of the seizures and decrease already prior to the end of 
the seizures. This indicates a relative shift from more random towards more regular 
and back towards more random network topologies. Thus, the seizure state might 
be associated with more regular network topologies, which is in accordance with 
previous findings obtained from analyzing a smaller number of seizures Il58l . These 
findings come along with relative changes of the average edge density (right panel) 
which follows a convex-like movement, indicating a relative shift from denser to- 
wards sparser and back to denser networks. 

EEG recordings of epileptic seizures suggest that seizure dynamics are character- 
ized by rapid changes in time and frequency II190H193I during finite periods of time 
(usually 1-2 minutes). Choosing a length of the analysis windows (here: 500 sam- 
pling points, 2.5 seconds as a trade-off between temporal resolution and statistical 
reliability of estimators of signal interdependence) introduces an additional time 
scale which might influence results obtained from the subsequent network anal- 
yses. Furthermore, time series obtained from measurements are inevitably finite 
which limits the reliability of estimators of signal interdependence. The reliability 
of such estimators, which may also depend on the time scales present in the data, 
might also influence properties of derived interaction networks. Due to the adap- 
tive thresholding used for network inference, networks can possess varying edge 
densities. Results (cf. right panel of figure I3.5[) indicate that the edge density e un- 
dergoes systematic changes during seizures which might influence 7dp and A^p. 
Both are known to approach unity for e ^ 1. 

3.2 Discussion and outline 

The presented studies exemplarily demonstrate how interaction networks can be 
derived from spatially extended dynamical systems, and how network characteris- 
tics are analyzed and interpreted. Undoubtedly, the network approach towards the 
analysis and interpretation of multivariate data has contributed and still contributes 
to advance our understanding of complex systems and inspires the generation of 
new hypotheses. However, the fundamental issues of how to identify nodes and 
edges in spatially extended dynamical systems and how to assess significance of 
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findings are not yet fully understood. Moreover, it is conceivable that uncertainties 
with respect to these issues could affect properties of interaction networks derived 
from empirical data. 

Node identification is typically based on associating nodes with sensors capturing 
the dynamics. To this end, appropriate observables have to be chosen and sensors 
must be spatially placed. We already observed in section 13.1.1 1 that different record- 
ing modalities can lead to different findings obtained from network analyses. EEG 
and MEG recording techniques as used in section 13.1.11 do not only differ in their 
number of sensors and in the observables registered, but also in their spatial sam- 
pling scheme (including different spatial resolutions). Certainly, a spatial sampling 
scheme is usually chosen with regard to the spatial scales present in the system 
(thereby considering theorems for an appropriate sampling), but it also underlies 
technical constraints. This becomes also apparent when considering the placement 
of sensor grids schematically shown in figure 13.41 (cf. section I3.1.2[) , where it is 
straightforward to argue that the spatial sampling of the system will very likely in- 
fluence properties of interaction networks derived from the data. 

Edge identification is based on time series analysis methods which estimate inter- 
dependencies between signals. The reliability of such a method depends on various 
aspects such as the contamination of signals with noise contributions or the amount 
of available data. In addition, a successful inference of interdependencies will also 
depend on whether typical time scales present in the dynamics are technically ac- 
cessible and are accounted for by the chosen temporal sampling. Besides, time-re- 
solved network analyses approaches (cf . sections 13.1.21 and I3.1.3|) introduce addi- 
tional time scales (e.g., by splitting time series into sequential parts (windows) of 
prespecified length) from which networks are derived. This might also influence es- 
timators of signal interdependence. Finally, techniques are employed to infer edges 
from the estimates of signal interdependence. The exact influences of these tech- 
niques (edge- or mean degree-thresholding (section I3.1.1[) , adaptive thresholding 
(section I3.1.3|) , edge weight estimation (section I3.1.2|) , or significance testing 111511 ') 
on network properties are largely unknown. 

To assess significance of findings obtained from network analyses, values of net- 
work properties are compared to those from a null model. Some studies define a 
state of the system for which properties of interaction networks are determined 
and used for comparison (see, e.g., sections [3 . 1 . 1 1 or |3 . 1 .2 1) , while other studies make 
use of network null models (see, e.g., section 13.1.3)1 in which different concepts of 
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randomness are implemented to various extent. Among these null models, Erdos- 
Renyi graphs and degree-preserving randomized networks are most frequently 
used in field studies. Whether they are suited for interaction networks derived 
from the dynamics of a system which was spatially and temporally sampled, is not 
yet known. 

In this thesis, we investigate the influence of the spatial and temporal sampling 
on properties of interaction networks with modeling studies and simulation studies 
under controlled conditions. We study whether and to which extent findings carry 
over to field data studies by investigating interaction networks derived from the 
human brain with respect to the spatial and temporal sampling. In the light of 
these investigations, we discuss the appropriateness of commonly used null models 
and propose null models which can overcome identified limitations of previous 
null models. Given the vast number of different ways of how to derive interaction 
networks from empirical data, we need to focus our investigations on the most 
frequently used methods. To this end, we pursue the following strategies: 

• Wherever possible, we do not use specific estimators of signal interdepen- 
dence but instead take advantage of generic properties of such estimators in 
our studies (for instance, in large parts of chapter HJ. If studies require the def- 
inite use of estimators of signal interdependence, we will employ the absolute 
value of the correlation coefficient or the absolute value of the maximum 
cross correlation p^, both representing frequently used methods from the do- 
main of linear time series analysis techniques. We mention that it is still a mat- 
ter of debate whether to prefer methods from the domain of nonlinear time 
series analysis (for example, see references Il39lll94i ) or those from the linear 
domain (e.g., references Ill55ill95til97| ). The choice of an appropriate method 
will likely depend on the system and its investigated dynamical states II198I . 

• We translate estimates of signal interdependence into edges via threshold- 
ing. The threshold is chosen such that the network possesses a number of 
edges parametrized either by a prespecified mean degree or by an edge den- 
sity. We chose this approach because of its widespread use in the literature 
(for instance, see references Il20ll44ll64lll48ll ), for the sake of simplicity, and 
for its mathematical treatability. In addition, interaction networks obtained 
using this approach are unweighted and undirected, and thus can be charac- 
terized with well established and thoroughly studied methods. We note that 
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approaches allowing for the inference of weighted and directed interaction 
networks might help to gain deeper insights into the dynamics of complex 
systems. Although such approaches are promising, they are at an early stage 
of development at the time of writing this thesis and not yet widely used in 
network analyses of field data. 

• Among the plethora of techniques available for characterizing networks, we 
focus on methods yielding a scalar value from the analysis of a network. 
This way, we avoid potential complications arising from subsequent steps of 
analysis in which characteristics of different networks are often compared to 
each other. For instance, if networks possess different sizes, it is not yet well 
understood how to compare properties which cannot be represented by a sin- 
gle scalar value (e.g., clusterings, centralities) with each other. We choose the 
clustering coefficient and the average shortest path length as network charac- 
teristics because of their widespread use in the literature and because of their 
importance in the context of small-world networks. In addition, we choose 
the assortativity coefficient as network characteristic which is investigated in 
an increasing number of field studies in order to assess resilience and or- 
ganization of networks. Besides, this will enable us to gain insights into the 
usefulness of degree-preserving randomized networks for serving as network 
null model. 
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Characterizing the dynamics of a complex system in general requires a number of 
choices which have to be made prior to analysis. If the equations of motion of the 
system are not known (which is most often the case in studies of natural systems), 
investigations of the dynamics of a system usually rely on repeated experiments 
carried out under well defined conditions during which data from some appropri- 
ate observables are collected. When studying the dynamics of spatially extended 
dynamical systems, such as climate dynamics, dynamics of earth-quakes or of the 
human brain, the identification of appropriate observables which are accessible 
via measuring instruments can pose a highly non-trivial challenge. A number of 
sensors is placed so as to sufficiently capture the dynamics of the system. Sensor 
placement may be based on spatial sampling strategies (e.g., following the Nyquist 
theorem), or on a priori knowledge of the structural organization of the system 
(which is often not available), or on the intuition of the experimentalist. In most 
cases, the placement and the number of sensors is also subject to constraints im- 
posed by the measuring instruments and by finite resources. 

Interpreting the dynamics of a system in terms of an interaction network comes 
along with the assumption that the dynamics can be well represented by interac- 
tions (edges) between different subsystems (nodes). As nodes are associated with 
sensors, the number and spatial placement of sensors, which are often arranged in a 
lattice-like way, may affect the topology of the derived interaction network. In cases 
in which subsystems of the dynamics cannot be unequivocally identified, different 
sensors may pick up the activity of the same subsystem (i.e., a common source). 
In addition, since repeated experiments with well controlled changes of conditions 
are difficult to establish for various natural systems (e.g., the climate system), the 
inference of causal relationships between subsystems is usually replaced by the in- 
ference of correlations between time series. The accuracy of the inference of edges 
is typically restricted due to a finite amount of accessible data and is spoiled by 
unavoidable noise contributions, all of which may also influence the topology of 
derived interaction networks. In this chapter, we address the question whether and 
how these influences affect the inference of prominent network characteristics such 
as clustering coefficient, average shortest path length, and assortativity coefficient. 
These characteristics have been repeatedly used in field studies to classify inter- 
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action networks into network classes (lattices, small-world networks, random net- 
works, assortative or dissortative networks) and to draw conclusions about organi- 
zation principles of the dynamics of natural systems. Interaction networks derived 
from empirical data have frequently been reported to possess a small-world topol- 
ogy and to be assortative. Given these ubiquitous findings, we address the question 
whether interaction networks can sensibly and reliably be classified into the afore- 
mentioned categories given the currently available analysis methods and given the 
way how interaction networks are derived from empirical data. 

This chapter is organized as follows: in section 14.11 we begin with an example 
from field data analysis. Interaction networks are derived via thresholding the ab- 
solute values of the correlation coefficient and are compared to networks whose 
edges reflect spatial distances between sensors only. We study the impact of mea- 
surement uncertainties and a lattice-like arrangement of sensors (section 14.2. 1|) as 
well as the impact of common sources (section I4.2.2[) on network properties of de- 
rived interaction networks. We discuss the issue of node and edge identification 
in interaction networks as well as the use of traditionally employed network null 
models (Erdos-Renyi networks and degree-preserving randomized networks) in 
the light of the results reported in this chapter (section |4.3|) . Finally, we discuss 
approaches which can help to deal with the challenges of spatial sampling. 



4.1 Exemplary field data analysis 

We analyzed multivariate time series of brain magnetic activities recorded by a 
148-channel magnetometer system (magnetoencephalography (MEG), see reference 
II167II ') from a healthy subject with eyes closed [63J. The MEG data were sampled 
at 254.31Hz (within the frequency band 0.1-50 Hz) using a 16-bit analog-to-digital 
converter. We discarded time series recorded by the lowermost sensor ring due 
to potential contaminations with muscle activity, which restricts the number of 
available time series to N = 130 (see top panel in figure |4J] for a schematic showing 
the spatial arrangement of a subset of sensors). The length of time series was T = 
4096 sampling points, and signal interdependence between all pairs of time 
series were estimated using the absolute value of the linear correlation coefficient, 
|0?- (cf. section |2.2.1[) . Matrix W = p*^ is shown in the left panel of figure |4]T](B). As 
has been pursued in many field studies, we derive from W the adjacency matrix 
A- (right panel) of the interaction network via thresholding (we exemplarily choose 
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Figure 4.1: (A) Schematic of the spatial arrangement of a subset of sensors used 
to sample the dynamics of a human brain by MEG. (B) Left: Exemplary matrix W 
where entry yVij=yVji is the absolute value of the correlation coefficient (p'^j) between 

MEG time series Xi{t) and Xj{t) from sensor pair Right: Adjacency matrix A. 
derived from W by thresholding with k = 15. (C) Left: Matrix VV with entries 
Wij = F{dij), where dij denotes the Euclidean distance between sensors i and / in 
3-dimensional space, and F(dfy) = (1 + exp{u{dij — v)))~^ with u = 23 and v = 0.1. 
Right: Ji derived from VV by thresholding with k = 15. Note that Ji is not affected 
by the choice of F, as long as F decreases strictly monotonically with increasing djj. 
Entries of all matrices range from (black) to 1 (white). 
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a mean degree of = 15). W and A. display patterns of diagonals which can be 
attributed to spatially close pairs of sensors. 

From v4. we determine the clustering coefficient C = 0.58, the average short- 
est path length L = 3.13, and the assortativity coefficient a = 0.67. The value of 
a suggests that the interaction network is strongly assortative. In order to assess 
whether the interaction network possesses a small-world topology, we here follow 
an approach pursued in many field studies: 100 random networks are derived from 
A- by degree-preserving randomization of edges (cf. section I2.1.2|) . We denote the 
mean values of the clustering coefficients and of the average shortest path lengths 
of these networks by Cpp and Lqp, respectively. We determine 7dp = C/Cop and 
Adp = L/L]jp (cf. section |2.1.2|) and assume — like in many field studies — 7dp ^ 1 
and Adp ~ 1 to be indicative of a small-world topology. In the following, we 
use 7dp > 2 and Aqp < 2 as a practical criterion. With 7dp = 4.21 ± 0.15 and 
Adp = 1.53 ± 0.01, this interaction network would be interpreted as small-world 
network. 

We now come back to the observation that W and A- display patterns of diago- 
nals which represent edges between nodes whose associated sensors are spatially 
close (cf . figure 14.11 (B)). Let us exemplarily consider a basic model which defines a 
network without relying on any information about the dynamics of the system but 
which is solely based on the spatial distances between sensors. Let pij be an inter- 
dependence measure which depends on the Euclidean distance dij between sensors 
in three-dimensional space only. We assume the measure to strictly monotonically 
decrease with increasing distance djj. Thus, pij will take on higher values for spa- 
tially close sensors than for spatially more distant sensors. The network derived 
from p via thresholding displays a distance-dependent connectivity structure and 
can be considered as a spatial network. We note that spatial networks ©[ZllSSI have 
attracted much interest in network sciences during the last years. In the left panel of 
figure 14.11 (C) we show matrix VV = p obtained for choosing a sigmoid function for 
Pij. A, is derived from VV via thresholding as in the previous paragraph (fc = 15). 
Note that A. does not depend on the exact choice of the interdependence measure 
as long as the latter decreases strictly monotonically with increasing d/y. VV and A 
show diagonal patterns which are similar to the ones observed in W and A. Given 
the model defining this network, we expect network characteristics to indicate a 
lattice-like topology (reflected in large values of the clustering coefficient and the 
average shortest path length compared to random networks, cf. section |2.1.2|) . From 
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Jiwe obtain C = 0.57, L = 3.14, and a = 0.57. The assortativity coefficient indicates 
this network to be strongly assortative. Comparing values of C and L to mean val- 
ues obtained for random networks derived via degree-preserving randomization of 
yA, we observe 7dp = 4.97 ± 0.18 and Aqp = 1.55 ± 0.01. Thus, even this network, 
whose construction was based on spatial distances between sensors only, would 
have been classified as small-world network. Together with the apparent similarity 
of W and VV, this observation indicates that the spatial arrangement of sensors 
may substantially influence the topology of interaction networks. 

4.2 Simulation studies 

The previous examples already suggest that C, L, a and probably also other network 
characteristics reflect the spatial sampling of a dynamical system and the way how 
interaction networks are derived from empirical data (i.e., how nodes and edges are 
identified). In addition, it has to be taken into account that empirical data is typ- 
ically affected by the unavoidable imprecision of the acquisition system and may 
be spoiled due to inevitable noise contributions. Moreover, the amount of available 
empirical data is finite which further restricts the accuracy of time series analysis 
methods. This limited accuracy together with thresholding methods for deriving 
interaction networks — for which the mean degree or edge density are often chosen 
empirically — may lead to spuriously missing or additional edges in the network. 
These considerations lead us to our first question: How reliable do we have to es- 
timate edges in order to safely infer characteristics of interaction networks from 
empirical data? Another aspect is related to uncertainties in sensor placement. Sen- 
sors, which are identified with the nodes of the interaction network, are placed so 
as to sufficiently capture the dynamics of the system, and high values of estimated 
signal interdependencies are considered to be indicative of interaction between dif- 
ferent subsystems. However, due to a lack of knowledge of the actual structural 
organization of the dynamical system or due to technical constraints imposed by 
the acquisition system, some sensors may capture the dynamics of the same subsys- 
tem which will lead to strongly interdependent signals II166U199U200II . Most bivariate 
time series analysis techniques cannot distinguish between signal interdependence 
caused by interactions between different subsystems or by common sources. How 
will this affect network characteristics even in cases, where such a distinction was, 
in principle, possible? We will address these questions in the following. 
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4.2.1 Measurement uncertainties and latticelike arrangement of sensors 

Let us consider an interaction network which possesses a lattice-Hke topology. This 
topology might reflect the lattice-like arrangement of sensors or might truly reflect 
the actual interaction structure of some dynamics. It might even reflect a mixture of 
both. Lattice-like networks are assortative and display large values of the clustering 
coefficient and of the average shortest path length. We investigate, in the presence 
of measurement uncertainties, how reliable we have to estimate edges in order to 
safely classify the interaction network as a lattice (according to clustering coefficient 
and average shortest path length) and as an assortative network (according to the 
assortativity coefficient). To this end, we model lattice-like interaction networks as 
follows: we generate square-lattices and associate sensors with nodes. We assume 
an interdependence measure (as in the previous section, p) to strictly monotonically 
decrease with increasing distance between sensors. The number of nodes N and 
the mean degree k for deriving networks are chosen such as to meet typical values 
reported in many field studies. Note that not every desired pair of (N, k) values can 
be realized with this construction (consider a node at the boundary of a lattice and 
a node within the center of a lattice). We added a small amount of noise to each 
sensor position, which we consider realistic since sensors cannot be placed with 
infinite precision in experimental setups. As a result, the degree will vary slightly 
from node to node (while the network as a whole will still possess a predefined 
mean degree k). We carefully checked that the added noise does not qualitatively 
change results of our simulation studies and thus can be considered as part of 
the construction process of the lattices. We mention that the following qualitative 
results can also be observed for three-dimensional lattices. 

Clustering coefficient and average shortest path length. As in section |4H we use 
100 degree-preserving randomized networks in order to obtain mean values 7dp 
and Adp for each lattice. In the top panels of figure 14.21 7dp and Aqp are shown 
for different pairs of values {N,k). We observe 7dp ^ 1 and App ~ 1 for a range 
of (N, k) values, which would indicate these lattices to possess small-world char- 
acteristics. The upper right region of the {N,k) plane contains networks with high 
edge density e (cf. top right panel of figure 14.3)1 . Since Cpp, C, and thus 7dp ap- 
proach the value 1 for e ^ 1, these lattices would not be classified as small-world 
networks. The lower left region of the {N,k) plane comprises networks with low 
edge densities for which App ^ 1 would not indicate small-world topologies. For 
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Figure 4.2: Top: Mean values of normalized clustering coefficient 7dp (left) and 
normalized average shortest path length A^p (right) for square lattices with dif- 
ferent numbers of nodes N and mean degrees k (maximum standard deviations: 
^7df ~ 0-02 ^^'^ '^Aop = 0.02). White crosses mark {N,k) configurations for which 
lattices will be classified as small-world network if 7dp > 2 and A^p < 2 is cho- 
sen as a practical criterion. Bottom: Minimum fraction of randomly replaced edges 
K* for which the resulting network would be classified as small-world network 
(Adp < 2) in dependence on the edge density e. Error bars denote standard de- 
viations derived from 100 independent replacement runs, and lines are for eye 
guidance only. Note that error bars are smaller than symbol size in the majority 
of cases. 



these networks, a reliable inference of edges is of crucial importance for a correct 
classification, which we demonstrate in the following. 

A limited reliability of the estimation of edges will lead to spuriously additional 
and spuriously missing edges in interaction networks. In principle, the probabil- 
ity of erroneously detecting edges (false positives) can be controlled by multiple 
testing against some appropriately chosen null model H381I151I . However, such ap- 
proaches are well known to possess a limited power leading to a starkly increased 
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number of false negatives (missing edges). Moreover, for the large numbers of time 
series usually considered in field studies, the generation of appropriate null models 
for time series (i.e., surrogates 1120 II ) needed for multiple testing methods can be 
computationally expensive. Nevertheless, we can carry over concepts from multiple 
testing in order to assess the reliability needed to correctly classify networks in the 
lower left region of the {N,k) plane as lattices using 7dp and App. We model un- 
certainties from estimating edges by randomly replacing edges in the network. Let 
fly denote the number of randomly replaced edges. We define the fraction k G [0, 1] 
of randomly replaced edges, k := 2n^/ (kN), which represents the false-discovery 
rate 11202 1 in the context of multiple testing methods. Note that the replacement 
of edges affects 7 only marginally, and we always observed 7^1. Let n* be the 
average minimum number of randomly replaced edges ^ for which the network 
would be classified as small-world network due to a decrease of the average short- 
est path length such that 1 ~ Adp < 2 (see section I2.1.2|) . The minimum fraction 
K* := 2n* / (kN) of randomly replaced edges is defined accordingly, and its depen- 
dence on the edge density e is shown in the lower panel of figure 14.21 A fraction k* 
of less than 2 % is sufficient to falsely classify the lattices in the lower left region of 
the {N,k) plane as small-world networks due to a decrease of L (and thus Aqp). k* 
even decreases for increasing edge density. Furthermore, depending on the chosen 
mean degree, we observed that only one to five randomly replaced edges lead to 
7dp < 2 for networks with a small number of nodes. This sensitive dependence of 
the average shortest path length on the edge structure has also been reported in a 
number of theoretical studies (see, e.g., references II1151I1161I2031I204I '). It is crucial 
for inferring small-world characteristics from interaction networks derived from 
empirical data: changing or adding just a few edges can cause remarkable changes 
in the average shortest path length. 



Assortativity coefficient. Values of the assortativity coefficient a are shown in fig- 
ure 14.31 (top left) for lattices which were generated as described in the previous 
section. We observe large positive values of a for most of the lattices in the (N, k) 
plane {a > 0.5 for a range of edge densities e, cf. figure |43] top right). This can be 
explained by the definition of the assortativity coefficient which aims at character- 

^ n* is determined by 100 replacement runs. For each replacement run s = 1, . . . , 100, we start with 
a lattice, randomly replace an arbitrary edge, and determine App. The random replacement of 
edges is repeated until Aqp < 2 in which case we denote the total number of randomly replaced 
edges as n* Then, n* := (E, n* 0/100. 
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Figure 4.3: Top: mean values of the assortativity coefficient a (left) and values of 
the edge density e (right) for square lattices with different numbers of nodes N 
and mean degrees k (maximum standard deviation obtained from 10 realization of 
the lattices in the {N,k) plane: Ca = 0.06). Bottom: mean assortativity coefficient 
(obtained from 10 simulation runs) in dependence on the fraction k of randomly 
replaced edges for an exemplary lattice (N = 100, k = 10). The grey shaded area 
marks the standard deviation, and lines are for eye-guidance only. 



izing the average similarity (a > 0) and dissimilarity (a < 0) of node degrees at 
either end of edges. In our lattice networks, neighbouring nodes possess degrees 
which are very similaio This leads to high values of a. For networks with low 
edge densities (lower left region of the (N, k) plane), we observe lower values of a 
but still a > 0.14. For increasing edge density (e > 0.5, upper right region of the 
(N, k) plane), values of a fluctuate around 0. Note that the assortativity coefficient 
is not defined for e = 1 since the variance of the degree sequence (ki = {N — 
vanishes. 

We study the influence of a limited reliability of estimating edges on a by ran- 



In ideal lattices with periodic boundary conditions, all degrees are identical. 
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domly replacing edges in the networks. The dependence of a on the fraction k of 
randomly replaced edges § is shown in the bottom panel of figure |43l for an exem- 
plary configuration of N = 100 and k = 10. Findings obtained for other lattices of 
the {N,k) plane are qualitatively similar. We observe a to decrease for increasing k 
which can be ascribed to the random replacement of edges: it tends to destroy de- 
gree-degree correlations in the network and appears to approach the Erdos-Renyi 
network model [6 J in the limit k ^ 1. For a small fraction of randomly replaced 
edges (k < 0.1), our findings suggest that the assortativity coefficient is not as sen- 
sitively affected as the average shortest path length by uncertainties in estimating 
edgeo 

Briefly summarizing, the often used lattice-like arrangement of sensors together 
with a limited reliability when estimating edges can lead to indications of small- 
world topologies of interaction networks derived from the dynamics of spatially 
extended systems even if the actual interaction structure is not small-world. More- 
over, a lattice-like arrangement of sensors can lead to interaction networks which 
possess positive degree-degree correlations and thus would be classified as assor- 
tative networks. 



4.2.2 Common sources 

As already mentioned above, sensor placement may be based on spatial sampling 
strategies, or on a priori knowledge of the structural organization of the system, 
or on the intuition of the experimentalist. Since the number and precise location 
of subsystems are often unknown prior to analysis, the number N of sensors and 
their locations are typically chosen empirically and may, in addition, be subject to 
technical constraints. It is thus not surprising that some sensors may capture signals 
of the same subsystem. This issue becomes important considering spatial sampling 
strategies and interpreting the derived interaction network: in field studies, high 
values of estimators of signal interdependence between time series are often con- 
sidered as indicative of a relationship between different entities (e.g., a functional 

fl(K) is determined by 10 simulation runs. For each simulation run r = 1, . . . , 10, we start with 
a lattice, randomly replace an arbitrary edge, and determine k = lur/ (kN). The random 

replacement step is repeated until k > 0.3. Finally we obtain a{K) = (X^^ fl(^)(K))/10. 
^ This finding, however, will substantially change if a limited reliability of estimating edges trans- 
lates into a random replacement of edges which favours edges between nodes of similar (increas- 
ing a) or different (decreasing a) degrees. We consider such systematic uncertainties unlikely in 
typical field studies. 
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interaction between subsystems). However, if two time series reflect the dynamics 
of the same subsystem (i.e., a common source), frequently used estimators of signal 
interdependence, such as the correlation coefficient or the mean phase coherence, 
will also indicate strong interdependencies between these time series, which would 
be erroneously considered as indicative of two interacting different entities. Uncer- 
tainties when placing sensors together with commonly used time series analysis 
techniques will likely lead to additional nodes and edges in a derived interaction 
network. 

We study the impact of common sources on the clustering coefficient, the aver- 
age shortest path length, and on the assortativity coefficient of derived interaction 
networks with two models. We assume a dynamical system to be well represented 
by a network J\f consisting of N nodes and some edges. Nodes represent sub- 
systems and edges reflect interactions between them. We model the influence of 
common sources by introducing for each sensor i an additional sensor i' with zero 
spatial distance between them. The resulting network J\f* then consists of N* = 2N 
nodes. In our first model, we assume that edges are derived by using a time series 
analysis technique which cannot distinguish between interdependencies reflecting 
functional interactions and "false interdependencies" due to sampling the same 
subsystem. We note that this holds for most bivariate time series analysis meth- 
ods. The network according to the first model is denoted as J\f^. With our second 
model, we consider a time series analysis method which we assume to be able to 
distinguish between both cases. The resulting network is denoted as M^^- 

First model. Due to the placement of the duplicate sensor, the corresponding node 
i' of the interaction network is connected to the neighbours of node i (cf. inset of 
figure l4!4l left). In addition, i' is connected to i since both associated sensors sample 
the same subsystem, and the considered time series analysis methods indicate per- 
fect signal interdependence. We derive the clustering coefficient C* and the average 
shortest path length L* of the network J\fi as functions of C, and L of A/" as 




(4.1) 



L* = L + Li with Li 



N 



(4.2) 



2\S\' 
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Figure 4.4: Results obtained for the first model. Left: Local clustering coefficient C* 
of node i of Af^ as a function of C; of Af for different node degrees A:,. Construction 
of Af^ is shown schematically in the inset. Nodes and edges included in J\f and Mi 
are colored black, while nodes and edges only included in Afi are colored gray. 
Right: Means of C(p) := C(p)/C(0) (open symbols) and L(p) := L(p)/L(0) (filled 
symbols) for J\f depending on the rewiring probability p (lines are for eye-guidance 
only). C*(p) and L*(p) denote the corresponding quantities for M^. We used the 
Watts-Strogatz scheme (N = 1000, = 4, 1000 realizations for each p) to generate 
J\f networks (symbol A) and derived Af-^ networks (symbol v) by duplicating all 
nodes from Af. Standard deviations for all quantities are smaller than symbol size. 



where kj and | S \ are quantities of Af and denote the degree of node i and the num- 
ber of pairs of nodes connected by some path, respectively (see section I2.1.1[) . The 
derivation of these equations is provided in section [721 Note that Li G [277'!]' 
where the lower bound holds for connected networks (a path exists between ev- 
ery pair of nodes) and the upper bound for networks without edges. Obviously, 
the impact of introducing additional nodes (i.e., sensors) on the average shortest 
path length can be neglected since L* ~ L. In contrast, the clustering coefficient 
is increased, C* > C, because for the local clustering coefficients C* > C; holds. 
Their increase depends on the degrees of nodes as well as on C, (cf. figure 14.41 
left). In order to demonstrate this effect, we generate network topologies of A/ us- 
ing the Watts-Strogatz small-world model [741 iri which edges are rewired with 
probability p: starting from a ring-lattice (p = 0), different topologies are obtained 
by successively increasing p until random networks § are reached for p = 1 (cf. 
section |2.1.2|I . In the right panel of figure l4!4l we observe for all rewiring probabili- 



^ We follow the wording in reference [74] and call networks obtained for p = 1 random networks. 
Note, however, that these networks are locally not equivalent to random networks since they 
retain some information about the rewiring procedure II114L 
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ties L*(p)/L*(0) ^ L(p)/L(0). In contrast, C*(p)/C*(0) clearly exceeds C(p)/C(0) 
when increasing p such that even networks derived from random networks J\f 
(p = 1) would be characterized as small-world networks. 

We now derive (cf. section [72] for details) the assortativity coefficient a* of A/'^*, 

a* = a\a + a2, (4-3) 

where 

^1 • "n'^-kr + 1)3 - (E(2fc. + 1)2)2/ ^(2fc^. + 1) ^^-^^ 

and 



_ (8(Efc?)(i + Efcf / Efc.) + 2 Efc. + E(2fc. + 1)2 - 

E(2fc. + 1)3 - (E(2fc. + 1)2)2/ ^(2fc^. + 1) ^ ^ (4-5) 

are functions of the degrees of nodes in M, and a denotes the assortativity co- 
efficient of M . We demonstrate this dependence by generating networks M with 
different degrees of assortativity or dissortativity, i.e., different values of a. To this 
end, we start with an Erdos-Renyi network from which we derive networks us- 
ing a degree-preserving but degree-degree (anti-) correlations inducing rewiring 
scheme [|122tll23i . The degree of assortativity or dissortativity is governed by some 
probability p with which a rewiring step must favour a rewiring which increases 
or decreases a, respectively. In the limit p = 0, this rewiring scheme becomes iden- 
tical to the one widely discussed and used in the literature B86llll0lTll3l for gener- 



ating degree-preserving random networks without degree-degree correlations. In 
figure |4.5[ the dependence of the assortativity coefficient a* on the assortativity 
coefficient a is shown for different values of the mean degree k oi J\f (left panel: 
k = 2, right panel: k = 4). Since the rewiring process leaves the degrees of nodes 
unchanged, ai and ^2 are constants. We observe the assortativity coefficient of M-^ 
to be increased compared to the one of Af, and the relative increase becomes larger 
the smaller the mean degree k (for networks possessing edges). Remarkably, for a 
regime of values of a indicating a network J\f to be dissortative {a < 0), we even 
find fl* > indicating to be assortative. 

Briefly summarizing the findings obtained for the first model, common sources 
together with frequently employed time series analysis techniques used to infer 
edges likely lead to indications of small-world and assortative network topologies 
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Figure 4.5: Results obtained for the first model. Dependence of the assortativity 
coefficient a* of A/\* (symbol v) on a of A/" for networks with N = 1000 nodes and 
a fixed degree sequence. The degree sequence was obtained from an Erdos-Renyi 
network (N = 1000) with mean degree k = 2 (left) and k = 4 (right). Networks A/" 
were generated from the Erdos-Renyi network by employing a rewiring scheme in- 
creasing or decreasing degree-degree correlations. Lines are for eye-guidance only. 

even in cases where the underlying interaction structure is neither small-world nor 
assortative. 

Second model. We consider a time series analysis technique which we assume to 
be able to distinguish between interdependencies reflecting functional interactions 
between different subsystems and interdependencies due to a common source. As 
in the first model, we introduce for each sensor i an additional sensor i' with zero 
spatial distance between them. In the corresponding interaction network, node i' 
is connected to all neighbours of node i. In contrast to the first model, i' is not 
connected to i since the considered time series analysis methods do not indicate a 
functional interaction between i and i'. We derive (see section 17.21 for details) the 
clustering coefficient C* and the average shortest path length L* of Af-^ as functions 
of Ci and L of A/" as 

C; = (4.6) 
L* = LiL + L2, (4.7) 

where 
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Figure 4.6: Same as figure l4!4l but for the second model. Left: Local clustering coeffi- 
cient C* of node i of A/'2* as a function of C; of Af for different node degrees fc/. Con- 
struction of J\f2 is shown schematically in the inset. Nodes and edges included in J\f 
and A/J are colored black, while nodes and edges only included in A/J are colored 
gray. Right: Means of C(p) := C(p)/C(0) (open symbols) and L(p) := L(p)/L(0) 
(filled symbols) for J\f depending on the rewiring probability p (lines are for eye- 
guidance only). C*(p) and L*(p) denote the corresponding quantities for Af2- We 
used the Watts-Strogatz scheme (N = 1000, = 4, 1000 realizations for each p) 
to generate M networks (symbol A) and derived A/J networks (symbol v) by du- 
plicating all nodes from J\f. Standard deviations for all quantities are smaller than 
symbol size. 

No denotes the number of nodes without neighbours in A/", Nq = \{i \ kj = 0,i = 
1, . . . , N} I . Note that Li G [1, 2], where the upper bound holds for networks without 
edges (No = N) and the lower bound for networks in which each node possesses 
at least one edge (Nq = 0) which, e.g., is the case for connected networks. Further- 
more, L2 G [0, 5], where the lower bound holds for networks without edges and is 
approached by connected networks (L2 = N~^). The upper bound is approached 
by the special case of networks with decreasing Nq and increasing number of con- 
nected components and reached for N/2 connected components and No = 0. Taken 
together, the impact of introducing additional sensors (i.e., nodes) on the average 
shortest path length can be neglected in networks possessing edges, and L* ~ L. 
Since C* < C,, the clustering coefficient C* is smaller than or equal to C depending 
on the degrees of nodes in J\f. Note that the maximum possible reduction amounts 
to C* = §C/ {ki = 2) only (cf. left panel of figure US and that C* = Q for ki G {0, 1} 
and that C* — ^ C, for increasing k^. These three factors will likely lead to only a 
slight decrease in C* in real world networks. 
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To demonstrate the relationships derived above, we generate networks A/" according 
to the Watts-Strogatz small-world model as before. C*(p)/C*(0) and L*(p)/L*(0) 
of A/J as well as C(p) / C(0) and L(p) /L(0) of A/" are shown for different values of 
the rewiring probability p in figure l46l (right panel). We observe C*(p)/C*(0) < 
C(p)/C(0) and L*(p)/L*(0) ^ L(p)/L(0) for all rewiring probabilities. Thus, net- 
works A/J derived from random networks Af {p = 1) would not be falsely classified 
as small-world but as random network. 

We continue and derive the assortativity coefficient a* of A/'2* as a function of a of 
M (details can be found in section |7j2]). Remarkably, 

a* = a. (4.9) 

Summarizing the findings obtained for our second model, common sources do 
not affect the assortativity coefficient if edges are inferred using time series analysis 
techniques which are capable of distinguishing between interdependencies due to 
a common source and interdependencies reflecting functional interactions between 
subsystems. Moreover, for such time series analysis methods, common sources do 
not artificially increase the clustering coefficient. As a result, random networks are 
not misclassified as small-world networks in the presence of common sources in 
our model. 

Taken together, our findings indicate that interaction networks are likely to be 
classified as small-world networks even if the underlying interaction structure is 
lattice-like (due to measurement uncertainties) or random (due to the presence of 
common sources and the use of common time series analysis techniques). Moreover, 
interaction networks are likely to display assortative mixing of node-degrees even 
in cases in which the underlying interaction structure corresponds to a dissortative 
or uncorrelated network (due to the presence of common sources and the use of 
common time series analysis techniques). 

4.3 Discussion 

As demonstrated, properties of interaction networks derived from spatially ex- 
tended systems can non-trivially be influenced by the spatial sampling of the dy- 
namics. In the following, we discuss this influence in the context of the identification 
of nodes, the identification of edges, and the choice of null models. Finally, we sug- 
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gest research directions which can guide the development of methods taking into 
account the issue of spatial sampling. 

The identification of nodes is based on the assumption that the studied system 
can be meaningfully decomposed into different parts. While this decomposition 
can be straightforwardly achieved in many cases, e.g., when studying social net- 
works, transportation networks, or the internet, it represents a challenging task 
for the investigation of many spatially extended natural systems where either the 
exact structural organization of the systems is not known or the dynamics are spa- 
tial diffusion or field processes. The identification of nodes is often approached 
by associating nodes with sensors supposed to capture the dynamics of different 
subsystems, thereby translating the issue of node identification into the notoriously 
non-trivial challenge of spatially sampling the dynamics. This includes the choice 
of the number of sensors, the choice of a spatial sampling strategy (spatial arrange- 
ment of sensors) as well as choosing various characteristics of the sensors (e.g., 
sensitivity). The spatial sampling implicitly leads to a coarse graining of the dy- 
namics and determines a spatial scale at which the dynamics is studied. Together 
with considering a spatially extended system as a network of interacting subsys- 
tems, the spatial sampling imposes a spatial structure on the system, irrespective of 
its actual organization, which may also underlie spatial restrictions. 

We analyzed an exemplary recording of brain magnetic activity (cf . section 14. 1[) 
and compared the derived interaction network with a network generated from a 
spatial model which depended on the position of sensors in three-dimensional 
Euclidean space only. The remarkable similarity of the clustering coefficient, the 
average shortest path length, and the assortativity coefficient of both networks al- 
ready suggested a strong influence of the spatial sampling on network properties. 
Both networks would be classified as assortative small-world networks when com- 
paring their properties with those of degree-preserving randomized networks. In 
simulation studies (cf . section I4.2[) , we demonstrated that the spatial sampling can 
introduce spatial correlations in the topology of derived interaction networks. We 
studied experimental setups in which sensors capture the dynamics of the same 
subsystem (a common source) leading to similarities in the recorded time series. In 
order to infer edges, we considered typical time series analysis techniques which 
cannot distinguish between signal interdependencies due to common sources and 
interdependencies reflecting interacting different subsystems (see first model in sec- 
tion |4]Z2]). Nodes associated with sensors capturing the same dynamical subsystem 
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lead to an increase of the clustering coefficient, because these nodes are highly in- 
terconnected to each other due to the common source. It has been suggested to 
manually correct the clustering coefficient for this influence 11205 II , but such an ap- 
proach relies on a priori knowledge about the exact spatial organization of the 
system which may not be generally available. In our model, nodes capturing the 
dynamics of the same subsystem possess the same degree and, in addition, are 
connected to each other. Thus, common sources induce extra edges between nodes 
of similar or equal degree which increase the assortativity coefficient of the net- 
work. In our simulation studies, we observed that this can lead to a classification 
of an interaction network as assortative network even in cases where the actual in- 
teraction structure is dissortative. We found the average shortest path length also 
to be influenced by common sources but to a much smaller extent than the clus- 
tering coefficient and the assortativity coefficient. This may be partly attributed to 
the fact that nodes reflecting the same subsystem possess the same neighbourhood, 
are connected to each other, and thus share a common pattern of shortest paths. 
However, the value of the average shortest path length was sensitively influenced 
by uncertainties when estimating edges, as discussed in the following. 

The identification of edges poses a challenge which is partly interrelated with the 
issue of identifying nodes. Active probing for interactions between subsystems is 
often not possible in natural dynamical systems. Instead, interactions are inferred 
from observations by interpreting signal interdependencies estimated using time 
series analysis techniques. The inference of edges is then influenced by several 
factors which we discuss in the following and which may be associated with four 
aspects, namely the issue of common sources, the issue of indirect interactions, the 
issue of a limited reliability of edge estimation in the presence of noise and a limited 
amount of empirical data, and the question of how to decide whether to translate 
an estimated value of signal interdependence into an edge or not. 

First, as discussed above, common sources lead to additional edges in derived 
interaction networks since most time series analysis techniques cannot distinguish 
between interdependencies due to common sources or interdependencies reflect- 
ing interactions between different subsystems. Methods capable of unequivocally 
distinguishing between both types of interdependencies could remedy the prob- 
lem of an artificial increase of the clustering coefficient and the assortativity coef- 
ficient as suggested by our simulation studies (see second model in section I4.2.2[) . 
To our knowledge, only few time series analysis approaches have been proposed 
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B2001I206I - I208II which address the problem of sampling common sources employ- 
ing different strategies. Methods proposed in references II2001I2071I208II are based 
on the assumption that a common source leads to instantaneous interdependencies 
(with zero time lag) between time series. If these instantaneous interdependencies 
could be separated from those associated with a non-zero time lag, this would lead 
to techniques capturing interdependencies reflecting interactions between different 
subsystems only. Another strategy is based on a priori knowledge of the system 
and relies on the modeling of common sources II206L All these methods have not 
yet been thoroughly investigated in the context of deriving interaction networks 
and, in addition, have not yet found wide application in field data studies. They 
do not account for the second issue, namely the challenge of how to distinguish 
between direct and indirect interactions. Although we did not explicitly study this 
influencing factor, its effect on the topology of derived interaction networks can be 
straightforwardly deduced: signal interdependencies between two different non-in- 
teracting subsystems can arise due to a third subsystem which interacts with the 
other two (see, e.g., references 11209112121 '). This will likely lead to the inference of 
edges between neighbours of a node and thus to an artificial increase of the cluster- 
ing coefficient of the derived interaction network. Third, a limited reliability of the 
estimation of edges in the presence of unavoidable noise contributions and a limited 
amount of available data likely leads to the spurious addition of, change in, or the 
deletion of edges. We observed in our simulation studies (cf. section |4.2.1|) the aver- 
age shortest path length to depend sensitively on the actual edge structure which 
is in agreement with a number of theoretical studies (see, e.g., II1151I1161I2031I204II '). 
Uncertainties of edge estimation will likely introduce spurious short-cuts in the net- 
work decreasing the average shortest path length. While the average shortest path 
length can significantly change when changing just a few edges, the clustering co- 
efficient and the assortativity coefficient appeared to be more robust with respect to 
uncertainties in edge estimation. Taken together, the artificial increase of the clus- 
tering coefficient and the assortativity coefficient due to common sources and the 
artificial decrease of the average shortest path length due to a limited reliability 
when estimating edges will likely lead to interaction networks which are classified 
as small-world networks with assortative edge structure. Our results show that this 
can also be expected for derived interaction networks where the underlying inter- 
action structure of the system has a lattice topology (cf. section 14.2. In addition, 
if sensors are arranged in a lattice-like fashion and spatially neighboured sensors 
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pick up activity from common sources, a lattice topology will naturally arise from 
the measurement and the way how interaction networks are typically derived from 
empirical data. The topology of such a network will likely be classified as small- 
world given the sensitive dependence of the average shortest path length on noise 
contributions. This sensitive dependence on the actual edge structure calls for the 
development of improved time series analysis techniques and for the control of 
the amount of spurious edges in the inferred network. This is related to the fourth 
aspect, namely the question how to decide whether to translate an estimate of sig- 
nal interdependence into an edge or not. In principle, this decision can be based 
on significance testing against some appropriate null model. Multiple testing tech- 
niques have been developed to control the probability of false positives (spurious 
edges) in networks derived from empirical data II151L While methods controlling 
the familywise error (i.e., the probability of detecting spurious edges among all 
possible pairs of nodes) have been developed over the years but are known to come 
along with a high risk of false negatives (spuriously missing edges) II213II , methods 
controlling the false-discovery rate (i.e., the probability of false positives among 
all inferred edges) appear to be promising approaches with a lower risk of false 
negatives Ill51il202ll214ll . However, limiting the probability of erroneously adding, 
changing, or deleting just a few edges — needed for a reliable estimate of the average 
shortest path length — calls for small probabilities of both, detecting false positives 
as well as missing false negatives, which represents a challenging task for currently 
available multiple testing methods. 

Network null models can be used to assess the significance of properties found 
in interaction networks derived from empirical data. Null models usually imple- 
ment some default position which is expected to be matched in the trivial case 
and which needs to be rejected in order to establish significance of findings. The 
spatial sampling of the dynamics of a spatially extended system leaves an imprint 
in the topology of derived interaction networks, but the most frequently employed 
network null models in field data studies, Erdos-Renyi networks and degree-pre- 
serving randomized networks, do not account for this imprint. As a result, many 
findings of small-world topologies in interaction networks of spatially extended 
dynamical systems might be attributed to the use of null models not taking into ac- 
count an artificially increased clustering coefficient due to the spatial sampling. We 
even observed that a comparison of properties with those of random networks can 
falsely indicate an actual lattice to possess a small-world topology (cf. section |4. 2. 1|) 
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according to a widely employed classification scheme. This is because a comparison 
with some null model can only provide clues as to how much the topology differs 
from the one of the null model (in this case a random network). A comparison with 
lattices has been proposed 112151 but has not yet been frequently employed in field 
studies. Indeed, using lattices as null models will likely indicate derived interaction 
networks to possess small-world topologies since such a null model does not take 
into account uncertainties of estimating edges which can significantly decrease the 
average shortest path length. In addition, one has to decide upon the dimensional- 
ity and construction of lattices, which both can decisively affect the result of such 
a comparison. Another result of using Erdos-Renyi networks or degree-preserving 
randomized networks as null models is the finding of interaction networks which 
are assortative. Both null models describe random network ensembles which are, 
by definition, neither assortative nor dissortative. Our results indicate, however, 
that the spatial sampling likely leads to the inference of interaction networks which 
are assortative, irrespective of the underlying interaction network structure. Taken 
together, our finding call for the development of refined null models taking into 
account the effects of spatial sampling on the network topology. 

In this chapter, we restricted our investigations to the clustering coefficient, the 
average shortest path length, and the assortativity coefficient. We believe that other 
network characteristics (for instance centrality measures or community structures) 
can also be strongly influenced by the spatial sampling. A steady growing number 
of studies employing such measures call for an investigation of potential influences 
of the spatial sampling. Different research directions appear promising to approach 
the issue of spatial sampling. These directions may be attributed to two main strate- 
gies. The first strategy aims at an improved identification of the actual structural 
organization of the dynamical system and can help to advise the design of appropri- 
ate sensor placement schemes. While this approach is currently being pursued, for 
instance, in the neurosciences II162II , it appears to be appropriate for those systems 
in which subsystems can be unequivocally identified. If the latter cannot be mean- 
ingfully achieved (which might be the case for spatial diffusion or field processes), 
a representation of the dynamics of such systems by an interaction network will al- 
ways constitute a coarse graining of the dynamics. The value of such a description 
may vary and will depend on the application and aim of the study. Influences of 
the coarse graining scheme and the spatial scale on analysis results have been stud- 
ied under different notions in various contexts among which we mention spatial 
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analysis of areal data (see references II216[|217| and references therein), climate sci- 
ence (e.g. reference II218II ), or in interaction networks derived from fMRI data II219II . 
The second strategy aims at improving existing and developing novel time series 
and network analysis techniques Il29ll200ll208ll215ll220ll221| as well as null models 
which take into account the spatial sampling of the system. Such developments 
may benefit from computational network analyses (see, e.g., 112221 - 12241 ). Among 
the many possible directions we mention spatial null models Il28ll225l , data-driven 
node-merging strategies (which represent coarse graining schemes on the network 
level) 1 291 12211 , the development of network characteristics that are invariant under 
influences of spatial sampling 11226 L and the development of time series analysis 
techniques which aim at distinguishing between direct and indirect interactions (cf. 
chapter 8.3 in |[T33l and references ll2T0ti2T2ll227H2^ ). These strategies can help 
to disentangle network characteristics reflecting true functional interactions from 
those spuriously arising from the spatial sampling of the dynamics. 
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As was demonstrated in the last chapter, the spatial sampling of a system can in- 
troduce non-trivial structure in the topology of interaction networks. This structure 
typically does not reflect properties of the studied dynamics but properties induced 
by the sampling scheme superimposed on the actual (and often unknown) spatial 
organization of the system. Effects induced by spatial sampling will probably be of 
less importance if properties of interaction networks are to be compared across dif- 
ferent measurements during which the spatial sampling scheme does not change. 
A common scenario would be a sliding window analysis of long-lasting multivari- 
ate time series, where relative changes of network properties across windows are 
of interest only (see, e.g., references H59ll63l [T64ll230ll231in . 

Let us assume that we could spatially sample a dynamical system under study 
in a perfect way. In addition, let noise contributions be negligible. Will interaction 
networks solely reflect mutual relationships between interacting dynamical subsys- 
tems in such a situation? We will now focus on two aspects connected to the tem- 
poral sampling of the dynamics. First, time series considered in field studies are 
inevitably finite which might introduce spurious properties in derived interaction 
networks. This issue aggravates in the light of a growing interest in time-resolved 
network analyses, where the length of time series has to be chosen small enough 
in order to allow for a high temporal resolution. Thus we will study possible influ- 
ences of the length of time series on properties of interaction networks. Second, the 
dynamics of subsystems may act on different time scales which might, in addition, 
also change over time. Depending on the time scales captured by the recording, 
typical estimators of signal interdependence might show a varying limited reliabil- 
ity, which in turn might affect properties of interaction networks. Assessing time 
scales in the data can be achieved, for example, in the time domain (auto-correla- 
tion function) or in the frequency domain (power spectral density estimates)0. Here 
we choose the latter. 

This chapter is organized as follows: the first part (section 15.1} is devoted to the 
theoretical and numerical study of widely used network characteristics (clustering 
coefficient, average shortest path length, assortativity coefficient, degree distribu- 
tion, edge density, connectedness) in dependence on the length and on the spectral 

^ Both are closely interrelated by the Wiener-Khinchin theorem. 
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contenlB of time series. In the light of interaction networks being frequently re- 
ported in field studies to possess a small-world topology and, if assessed, to be 
assortative, we pay special attention to these aspects in our simulation studies. We 
introduce a model which allows us to generate time series from which we derive 
interaction networks. In this model, we implement the null hypothesis that time se- 
ries are observed from independent stochastic processes. Interaction networks are 
derived by thresholding values of estimators of signal interdependence (absolute 
value of the correlation coefficient and the maximum cross correlation; see sec- 
tion I2.2.1[) . In order to facilitate the presentation of results and to keep the model 
as simple as possible, we assume all time series from which an interaction net- 
work is derived to possess the same number of sample points (a requirement met 
in most studies) and, on average, the same frequency content. The last require- 
ment, which we call homogeneity assumption, will be relaxed in the second part of 
this chapter (section [5.2[) . There we study, in a time-resolved manner, multichannel 
electroencephalographic recordings of 100 epileptic seizures, which are known for 
their complex spatial and temporal dynamics. We investigate whether dependen- 
cies identified in the simulation studies can also be observed in empirical data. In 
addition, we propose a framework for generating random networks tailored to the 
way how interaction networks are derived from multivariate time series. Using this 
approach, we demonstrate how properties of the interdependence structure related 
to the dynamics can be distinguished from those spuriously induced by the finite 
length of time series and their frequency content. We end this chapter (section 15. 3|) 
with a brief summary and discussion of results. 



5.1 Simulation studies 

We study networks derived from random time series of adjustable length T and 
with adjustable spectral contents. Let Zf, i G {1, . . .,N}, be time series whose en- 
tries Zi{t) are independently drawn from a uniform probability distribution U on 
the interval (0, 1). Choosing different values of T and inferring networks from mul- 
tivariate time series enables us to study the influence of the length of time series 
on properties of interaction networks. To study the influence of different spectral 
contents of time series on properties of derived interaction networks, we add the 

^ The spectral content of a time series is determined by power spectral density estimates. We will 
use the notions spectral content and frequency content interchangeably in the following. 
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possibility to low-pass filter the time series and define 

t+M-l 

x,M,r(0:=M-i E z,(Z), 2,(1) ^U^ (5.1) 
i=t 

where 1 < M <^ T and t G {l...,T}. M denotes the size of the moving average 
which controls the frequency content of the time series. Choosing large values of 
M results in time series with a high relative amount of power in low frequencies. 
Note that Xf i 7(f) = Zi{t)\/t, and that Xj^^j ^rid Xj^M,T independent for i 7^ ; 
by construction. When considering a particular realization r out of a total of R 
realizations of time series, it is denoted as x^^i t'^ ^ {1/ • • • / 

For all pairs of time series Xi^M,T arid Xj^M,T> signal interdependencies are esti- 
mated by determining either the absolute value of the correlation coefficient p^j or 
the maximum value of the absolute cross correlation (see section I2.2.1[) . We de- 
rive interaction networks from matrices p'^ or by thresholding with predefined 
edge density e (cf. section 12.2. 2|) . 

Most simulation studies we carry out follow a similar scheme: first, we study 
the influence of T on network properties by considering time series 1 x for dif- 
ferent T. Second, in order to study the influence of different spectral contents on 
network properties, we consider time series X;yvi,T' with T' = 500. We choose this 
value of T' because we want to investigate time series of short length as frequently 
considered in field studies. In both cases, we determine estimates of network prop- 
erties by calculating the average value of the considered network property obtained 
in R realizations of interaction networks. These networks are derived from R re- 
alizations of Xi^Mj foi" fixed values of e, M, T, and network size N. The obtained 
estimates are denoted by a hat-symbol and may depend on the chosen parameters, 
e.g., L{e, M,T). We omit the notation of the network size N because we choose 
N = 100 for all but one simulation study in the following. To keep the presentation 
of results concise and clear, we focus on results obtained using p'r- and only report 
results obtained using pf^ if these results are qualitatively different. 

5.1.1 Impact on clustering coefficient and edge density 

This section is organized in three parts. First, we study the influence of the length T 
of time series on the edge density e and clustering coefficient C of derived interac- 
tion networks. Second, we investigate a potential influence of the frequency content 
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of time series on the aforementioned network properties. Third, in the light of the 
findings obtained in the previous two parts, we trace back observed dependencies 
of network properties to properties of the time series generated by our model. 



Since the time series defined by equation (|5.1|) are independent, the question 



arises whether a derived interaction network, which is supposed to reflect interde- 
pendencies between time series, does possess any edges. To gain some intuition, 
we consider R realizations of two time series Xi''^j and x^''^^, r G {1, . . ■ ,R}, i 7^ /. 
To simplify notation, let 

Pii,\,T ■' 



( {r) (r) ^ 



denote the absolute value of the empirical correlation coefficient obtained for time 
series x^^^j and x'^^'^j. Since x^^^j and x^^^j are independent and the correlation coef- 
ficient is symmetric, values of the correlation coefficient will be distributed around 
the mean value 0. The variance of this distribution will be higher the lower we 
choose the length T of time series. Let us randomly pick one value p\-'.^ j out of the 

R values. Since almost surely p-j ^ > 0, there are thresholds 9 with < 9 < p-- ^ j 
for which we would establish an edge. Applying this argument to a number N 
of time series, we can find a threshold for which the resulting network possesses 
edges and, as a result, e > 0. Moreover, for a fixed value of > 0, we expect e to be 
larger the lower we choose T. For a constant value of e, we hypothesize that 9 will 
be higher the lower T. 

To explore this hypothesis, we derive an approximation e^i for the edge density 
by taking the asymptotic limit (T — > 00, see section [73] Lemma 2 for details), 

e^i{9,T)=2^{-VT9), (5.3) 

where denotes the cumulative distribution function of a standard normal dis- 
tribution. The dependence of e^i ori 9 is shown in the top left panel of figure 15.11 
for selected values of T. As hypothesized, the edge density indeed decreases for 
increasing 9 (while keeping T constant) and, for a constant value of 9, the edge 
density is higher the lower T. 

Since we took the asymptotic limit, the validity of equation (|5.3|) might be limited 
to the case of large values of T. Thus we numerically study the dependence of 
the edge density on 9 for small values of T, which are relevant in field studies. 
Consider R = 10^ values of Pi2M T obtained from R realizations of two time series 
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Figure 5.1: Top row: Dependence of edge density e{6,M, T) (left) and of clustering 
coefficient C{9,M,T) (right) on the threshold 9 for different values of the size M 
of the moving average and of the length T of time series. Values of edge density 
CaiiO, T) obtained by taking the asymptotic limit (equation (|5.3[) ) are shown as lines 
(top left). Bottom left: Dependence of the ratio 7(e,M, T) = Cm,t{£) /C^Rie) on 
edge density e. Note, that we omitted values of estimated quantities obtained for 

6 e {6 : (R-i Lr m,t(^) ^13 mA^)) < ^^~^} ^^^^^ ^he accuracy of the statistics 
is no longer guaranteed. Bottom right: Dependence of effective length Tgff as deter- 
mined by equation (|5.10)) (black line) and its numerical estimate T^ff (red markers) 
on M. 
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4,M T' ^ ^ re {1, . . . , K}. We estimate the edge density e{e,M, T) by 

mM,T) := K-^E^Sm,t(^)' (5-4) 

r 

where 

'f"'^^' (5.5) 
10 ,else. 

e{9,M, T) is the numerically determined probability that there is an edge between 
two nodes given 9, M, and T. We mention that e{9,M, T) does not depend on N. 
As shown in the top left panel of figure I5.1[ £{9, 1, T) matches well £^1(0, T) except 
for small values of T (T < 30). 

We continue by studying the clustering coefficient for our model networks. For 
a chosen length of time series, we expect to observe the clustering coefficient to 
decrease with increasing the threshold because the edge density becomes smaller. 

(r) 

Consider R realizations of three time series x- l^j, i G {1, 2, 3}, r G {1, . . . , R). We 
estimate the clustering coefficient by 

C{9,M,T)-= '-^^^ '-^j^^ — • (5.6) 

Indeed, for a constant value of T, the top right panel of figure 15.11 shows that the 
clustering coefficient C{9,1,T) is decreasing in 9. For constant values of 9, we ob- 
serve C{9, 1, r) to be higher the lower T. 

Comparing the clustering coefficient C{9,M, T) of our model networks with the 
clustering coefficient CER(e) obtained for Erdos-Renyi networks requires our esti- 
mate in equation (|5.6)) to be rewritten. Using equation (|5.4)) , we define 

CM,T{e) := C{9{e,M,T),M,T) (5.7) 

with 

9{e,M,T) := mi{9 : e{9,M,T) > e}. (5.8) 

This enables us to determine the ratio 7(e, M, T) := CM,T{e) /C^^{e) (cf. section 
I2.1.2[) . We observe 7(e, 1, T) to be higher the lower e and T (lower left panel of fig- 
ure |5Jl)- For a range of values of T and e , 7(e, 1, T) ^ 1. These findings suggest that 
there is a relevant dependence between the three random variables Pij,M,T> Pii,M,T' 



64 



5 Influence of temporal sampling 



and PjiM,T for small values of T and different indices i, j, and I. This dependence 
vanishes for T ^ oo and constant edge density, and C converges to Cer II232II . 

To investigate the influence of the spectral content of time series on the edge 
density and the clustering coefficient, we repeat the steps of analysis using time 
series Xj for which we keep T' = 500 constant and choose different values 
of M. The findings shown in figure 15.11 (top panels, lower left panel) demonstrate 
that the higher the amount of low frequency contributions in the time series (large 
values of M) the higher e{e,M,T') and C{e,M,T') (for constant 6 > 0), and the 
higher 7(e, M, T') (for constant e <^ 1). We observe 7(e, M, T') ^ 1 which is higher 
the smaller e and the higher M, underlining the difference between our networks 
and Erdos-Renyi networks. 

Summarizing the findings obtained so far, the similar dependence of e, C, and 7 
on T and M becomes apparent. We hypothesize that this similarity can be traced 
back to properties of time series, and, more specifically, to similar variances of Pij,i,T 
and Pij^M,V- We aim at determining a value of T = Tgff, the effective length of time 
series, which leads to Var(|0,y 1 7^^^) Var{pij^j^j/). By using the asymptotic variance 
of the limit distributions of T — > 00 (see section [Z3l Lemma 1 for details), we obtain 

Var(^,^- m,t) ~ g{M)Var{pij,ij), with g{M) = + (5.9) 
which allows us to define the effective length of time series, 

reff(M) is shown in the lower right panel of figure 15.11 and is decreasing in M. 
Since equation (|5.10)) was obtained by exploiting the asymptotic limit (T — ^ 00), we 
numerically study the case of small values of T as follows: we determine C{6, 1, T) 
for different values of 9 (like before) and for T G {3, . . . , T'}. In addition, for some 
chosen values of M, we determine C(9, M,T'). Finally, for each value of M, we 
determine a value T for which C{6,1,T) and C{6,M,T') curves best match in a 
least-squares sense. This value of T which is denoted as fgff is shown in figure 15.11 
(lower right panel). Indeed, Teff ^rid Tgff are in good agreement with a maximum 
deviation of \ f^ii — Tgffj ^ 2. Thus, equation (|5.10|) seems to hold also for small 



length T of time series. In figure 15. 1[ values of M and T for quantities e, C, and 
7 have been chosen according to equation (|5.10)) . Our above-mentioned hypothesis 
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is supported by the remarkable similarity between dependencies of e and C on 9, 
and 7 on e for pairs of values (M, T') and those dependencies obtained for pairs of 
values (1, Tgff). 

In summary, the clustering coefficient of networks derived from random time se- 
ries with a large amount of low frequency contributions or with a small number 
of sample points is higher than the one obtained for corresponding Erdos-Renyi 
networks — independently of the network size (cf. equation (|5.6|) ). We observed this 
difference to become more pronounced for lower edge densities, lower length of 
time series, or, likewise, for a larger amount of low frequency contributions. These 
findings reveal fundamentally different properties on the level of the network con- 
struction: in Erdos-Renyi networks, each possible edge is (1) equally likely and (2) 
independently chosen to become an edge. While property (1) is fulfilled in our 
model networks, property (2) is not, which becomes apparent in the clustering co- 
efficients differing from those of Erdos-Renyi networks. 

5.1.2 Impact on average shortest path length 

To investigate the influence of the length and frequency content of time series on 
the average shortest path length of derived networks, we pursue a similar but dif- 
ferent simulation approach. Consider an ensemble of K = 100 networks. Each net- 
work r (r G {1, . . . ,R)) possesses the same number N of nodes and is derived by 
thresholding p^^^j^j [i,] G {1, . . .,N}) using a fixed edge density. We set N = 100 
but also obtained qualitatively similar results for small network sizes (N = 50) as 
well as for larger network sizes (N = 500). Let L^''\e,M,T) denote the average 
shortest path length of network r derived from p^i^^j, and let ^-^'^(e) denote the 
average shortest path length obtained from the r-th Erdos-Renyi network of size N 
and edge density e. Mean values over realizations are denoted as L(e, M, T) and 
i'ER(e)/ respectively. In order to compare the average shortest path length of our 
networks with the ones obtained for corresponding ER networks, we determine 
A(e, M, T) := L{e,M,T) /L^^{e) (cf. section |2.1.2|) . As in the previous section, we 
consider L{e,M,T') {X{e,M,T')) for different values of M and fixed V = 500 as 
well as L(e, 1, T) (A(e, 1, T)) for different values of T and fixed M = 1. 

The dependence of L and A on e is shown in figure 15.21 for different values of T 
and M. L and A are decreasing in e since additional edges reduce the average short- 
est path length in our networks as well as in ER networks. Remarkably, we observe 
similar dependencies as in the previous section: L(e, 1, Tgff) ~ L(e, M, T') which in- 
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Figure 5.2: Dependence of the average shortest path length L(e,M, T) (left) and 
of the ratio A(e, M, T) = L{£,1,T) /L^^{e) (right) on edge density e for different 
values of the size M of the moving average and of the length T of time series. Lines 
are for eye-guidance only 

dicates that similar variances of the time series lead to similar average shortest path 
lengths in our model networks. Differences between our model networks and ER 
networks as characterized by A become more pronounced the smaller e, the smaller 
T, or the larger the amount of low frequency contributions (as parametrized by M). 
For typical edge densities reported in field studies (e ^ 0.1), these differences are 
not as pronounced (A < 1.2, cf. figure 15.21 right) as for the clustering coefficient 
(7 > 2 for selected values of M and T, cf. figure ISH bottom left). 

5.1.3 Impact on assortativity 

To assess the influence of the finite length and the spectral content of time se- 
ries on the assortativity of derived networks, we adopt the simulation scheme 
of the last section. Consider R = 1000 realizations of networks. Each network 
r G {1, . . .,J^} possesses N = 100 nodes and is derived by thresholding the val- 
ues J, z,y G {1, . . . , N}, such that the network has a prespecified edge density 
e. Let a^''\e, M,T) denote the numerically determined assortativity coefficient of 
network r. We determine d{e,M, T) by averaging over the values obtained for the 
realizations, a{e, M, T) = R'^ ^^"^ {e, M, T). To assess the influence of the spectral 
content of time series on the assortativity coefficient, we determine d{e, M, T') for a 
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Figure 5.3: Left panel: Dependence of the assortativity coefficient a{e,M, T) on the 
edge density e for different values of the size M of the moving average and of 
the length T of time series. Right panel: Dependence of the assortativity coefficient 
a{e, M, 500) and a{e, 1, Tgff (M)) on the size M of the moving average for a selected 
value of e = 0.1. Lines are for eye-guidance only 



fixed value of T' = 500 but different values of M and e. On the other hand, in order 
to explore a potential influence of the finite length of time series on the assortativ- 
ity coefficient, we determine d{e, 1, T) for a fixed value of M = 1 but for different 
values of T and e. Finally we mention that values of T and M are chosen according 
to equation (|5.10|l such that for each value of M we obtain a corresponding value of 
T = T,ii{M). 

In figure l53l (left panel), we show the dependence a{e,M, V) for selected values 
of M and the dependence of fl(e, 1, T) for selected values of T on e. For constant val- 
ues of e, we observe the assortativity coefficient to be higher the larger the amount 
of low frequency components (larger values of M) or the smaller the length of time 
series, a approaches values around as e increases. Remarkably, for a range of val- 
ues of e, M, and T, the assortativity coefficient clearly indicates our networks to be 
assortative. Values of a (e, 1,500) are slightly smaller than zero indicating a slight 
dissortative configuration of the networks. This dissortative configuration is also 
reflected in the assortativity coefficient aer of corresponding Erdos-Renyi networks 
(flER(e) ~ 1,500), data not shown) and is related to the finite size of studied 
networks [ll: we observed a (e, 1,500) (as well as flER(e)) to further decrease in the 
negative regime for smaller network sizes, N <^ 100, and to approach the value 
for higher values of N. 
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Figure |53] (left panel) also reveals that d{£, M, T') and d{e, 1, T) are approximately 
equal for large values of T = Tgff and small M but start to diverge for larger values 
of M and smaller values of T. To gain more insight into this issue, we show in the 
right panel of figure l53l the dependence of d{e, M, T') and d{e, 1, Tgff(M)) on M for 
a fixed value of e = 0.1. We observe that d{e,M, T) ^ d{e,\, Teii{M)) for M < 80 
and that both quantities become different for larger values of M or, equivalently, 
for smaller values of T. We suspect this finding to reflect that equation (|5.10[) , which 
has been derived for T — > oo, does not hold any more for very low length of time 
series. 



5.1.4 Impact on connectedness and degree distribution 

We continue with investigating the influence of the finite size and the frequency 
content of time series on the number of connected components Nc of interaction 
networks. As pointed out in section 12.1.11 Nc can affect the average shortest path 
length and determines the number of clusters if a cluster is defined as a connected 
component. Following the same steps as in the previous section, we derive R in- 
teraction networks from thresholding |ojjj^ j, i,] G {1, . . . , N}, r G {1, . . . , R}, N = 
100, R = 100 such that the networks possess a prespecified edge density e. We ob- 
tain Nc{£,M,T) as the average over the values Nc^\e, M,T) determined from the 
R interaction networks. In addition, for different values of e, we generate R Erdos- 
Renyi networks of size N = 100, and we determine Nc,er(£) as the average over 
^cEr(^) values. 

For different values of T and a fixed value of M = 1, the dependence of Nc(e, 1, T) 
on e is shown in the right panel of figure 15.41 Nc(e, 1, T) ~ 1 for all values of 
e considered here. This finding is in agreement with the number of connected 
components observed for corresponding ER networks, N^ERi^) ~ 1 for e > 0.05, 
which can be expected due to the connectivity condition In N/(N — 1) ~ 0.05 
(N = 100) which holds for ER networks (cf. section l2.1.2[) . The left panel of figure ISiH 



shows Nc{e, M, T') for different values of M and a fixed value of T' = 500. Remark- 
ably, for low edge densities (e < 0.25), the number of connected components is 
higher the larger the amount of low frequency contributions (as parametrized by 
M) indicating a stark difference between our networks and ER networks. In ad- 
dition, Nc{e, M,T') is larger than Nc(e, 1, reff(M)). This finding points towards a 
difference between our networks derived for different length of time series and 
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Figure 5.4: Dependence of the number of connected components Nc{e, M,T) on 
the edge density e for different values of the size M of the moving average (left, 
for T = 500) and of the length T of time series (right, for M = 1). Lines are for 
eye-guidance only. 

those derived for different frequency content of time series despite the variances of 
the underlying time series being approximately equal. 

We continue by numerically estimating the connectivity condition of our net- 
works, namely the minimum edge density e* or, equivalently, the minimum mean 
degree, k*, for which a network of a given size N is connected. For a given value of 
N, we determine the minimum mean degree k* of our networks as follows: consider 
time series x^^j j with R = 500 and i,] G {1, . . ■ ,N}. In a first step, we derive R 
networks from the time series using e = and we determine the fraction of the net- 
works which are connected (for e = this fraction will be zero). We repeat this step 
with an increased edge density (such that the derived networks possess one more 
edge than in the previous step) and again determine the fraction of the networks 
which are connected. The iteration is stopped as soon as the fraction reaches 95 %, 
and the edge density at this step is denoted as e*(M, T). e*(M, T) and k*(M, T) are 
determined by averaging the values obtained from 5 runs of this simulatioro . As in 
the previous sections, we choose different values of M and constant T = T' to study 
the influence of the frequency content as well as different values of T and constant 

^ The computation became feasible by exploiting the fact that the number of possible values of 
the edge density (or mean degree) is finite for finite networks. By making use of nested intervals, 
the minimum edge density or mean degree for which a network is connected was determined 
efficiently. 
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Figure 5.5: Dependence of the minimum mean degree k* (M, T) (left) and minimum 
edge density e* (M, T) (right) on the number of nodes N for different values of the 
size M of the moving average and of the length T of time series. 

M = 1 to investigate the influence of the length of time series on the connectedness 
of networks. In addition, we numerically determine the minimum edge density 
and minimum mean degree k'^^ of ER networks by following the same steps as for 
the calculation of e* (M, T) and k* (M, T) but with one difference: instead of deriv- 
ing networks from time series, we generate ER networks with prespecified numbers 
of edges. 

The dependence of e* (M, T) and k* (M, T) on N is shown in figure 15.51 Consid- 
ering the connectivity condition of ER networks, we expect the minimum degrees 
to take on higher values and the minimum edge density to take on lower values 
as N increases. Indeed, we observe e*(l, T) and k*{l,T) to agree well with the 
minimum edge density e^j^ and minimum mean degree k'^^ numerically obtained 
for ER networks, respectively (maximum differences: |e* (1,500) — e^j^l < 10~^, 
(1,500) — k^^l < 0.3). Just for short lengths of time series (T < 10), we observe 
slight differences between ER networks and our model networks in the minimum 
mean degree (cf. figure 153] left panel, \k*{l,7) - fcgj^j < 4.6). For M > 5, we ob- 
serve a strong deviation from e*{M,T') {k*{M,T') ) from e*(l,T) {k*{l,T)): for a 
given N, the minimum mean degree and the minimum edge density is higher the 
larger M. In addition, while the minimum mean degree for our networks derived 
for M = 1 and larger values of T appears to scale logarithmically with N (as does 
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Figure 5.6: (a-c) Degree distributions pk{€,M,T) estimated for R = 1000 realiza- 
tions of networks derived from time series Xi^Mj (N = 100) via thresholding using 
various edge densities e = k{N — 1)~^ and for selected values of the size M of the 
moving average and of the length T of time series. The symbol legend in (a) also 
holds for (b) and (c). (d) Dependence of correlation (kl(M)) between node degrees 
and spectral content in the lower frequency range on the size M of the moving 
average. Mean values of correlations obtained for R = 100 realizations of networks 
for each value of M are shown as crosses and standard deviations as error bars. 
Stars indicate significant differences in comparison to ^^(1) (Bonferroni corrected 
pair-wise Wilcoxon rank sum tests for equal medians, p < 0.01). Lines are for eye- 
guidance only. 
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the minimum mean degree for ER networks), the minimum mean degree of our 
networks derived from time series with a high amount of low frequency contribu- 
tions grows faster than In N. Taken together, larger edge densities (or, equivalently, 
mean degrees) than the ones for ER networks are necessary to assure connected- 
ness of networks derived from time series with a large amount of low frequency 
contributions. 

To gain a better understanding of the differences observed between networks 
derived from time series of small length and those obtained from time series with 
a large amount of low frequency components, we investigate degree probability 
distributions. We define the estimated probability of a node to possess a degree k 
as 

Pk := . (5.11) 

With p]^{e,M,T) we denote the estimated degree distribution for networks which 
are derived from x, ^ via thresholding with an edge density e. In figure 15.61 (a-c), 
we show estimated degree distributions obtained for different values of e (N = 100, 
R = 100) and, for comparison, different degree distributions of ER networks. We 
recall (cf . equation (|2.9[) in section I2.1.2[) that the degree distribution pk,N,ER of ER 



networks follows a Binomial distribution, 

PKnM^) = (^^ - ^)''"'"'- (5.12) 

As expected, the degree distributions shift towards higher values the larger e since 
k e. Remarkably, for different values of T but constant M = 1, we observe 
pi;{e,l,T) to coincide with the values Pk,N,ER obtained for corresponding ER net- 
works (within the errors to be expected due to the limited sample size). In contrast, 
for constant T' = 500 and different values of M > 1, we observe striking differences 
between pj^{e,M, T') and Pk,N, er- These differences become larger the higher M. In 
particular, the probability of nodes with zero degree (k = 0) increases for decreas- 
ing edge density and higher values of M. With the number of single nodes (each 
of which is considered as a connected component, cf . section 12. 1[) , the number of 
connected components observed in the networks increases. 

Given the results obtained so far, we hypothesize that differences in the degree 
distributions as well as in the number of connected components may be related 
to differences between the spectral content of time series ^ j, for M > 1, z G 
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{1, . . . , N}, N = 100. Specifically, a node t with a large degree kj might be associated 
with a time series x- ^ j, whose amount of low frequency contributions is larger 

(r) 

than most of the other time series x-j^j, j G {1, ...,N} \i. To investigate this 
hypothesis, we generate R realizations of time series x ■ ^ j, and determine their 
periodograms Pi^M^f), / ^ {0, . . - //Nyq}/ via Fourier transform 11233 II . /isjyq denotes 
the Nyquist frequency. We normalize all periodograms such that E/^^m(/) ~ ^■ 
From the same time series, we derive networks using e = 0.1 and determine the 
degree of nodes, kf^\ For some chosen value of /' G {0, . . . ,/Nyq}/ let us define 

/'-I /Nyq 

pI:m = L ^^f)' = L ^tl^f)' (5-13) 

/=0 /' 

where P]^'!^^ (^fu^) quantifies the total power in the lower (upper) frequency range. 
In addition, for each realization r, let 

= corr (fc(^), Pi^(^)) , k{;) = corr {k(^\ P^^'^) (5.14) 

denote the empirical correlation coefficients between the degrees and the corre- 
sponding total amount of power in the lower and upper frequency range, respec- 
tively. We determine mean values over realizations by kl(M) = i^~^lZr'^L^^ ^^'^ 
Ku(M) = R~'^Er^lj^- Note that kl(M) = -ku(M) by construction. /' = f'{M) 
is chosen such that 40% of the total power of the filter function associated with 
the moving average II233II is contained in the frequency range [0,/']. We mention 
that the exact choice of /' does not qualitatively change our results as long as 
< /' < /Nyq holds. 

In figure 15.61 (d), we show the empirical correlation between the degrees and 
the amount of low frequency contributions, kl(M), for different values of M. For 
M = 1, we do not observe a significant correlation, i.e., kl(1) ~ 0. For M > 1, 
however, the degrees of nodes are higher the larger (the lower) the amount of low 
(high) frequency contributions. This correlation becomes stronger for larger M. This 
finding supports our hypothesis that differences in the degree distributions can in- 
deed be related to different spectral contents of time series. In addition, considering 
the degree of a node as a way to quantify the centrality Il6l l84ll234| which is a lo- 
cal property of a network, our results highlight how univariate properties of time 
series (spectral content) may be reflected in local properties of networks (degree). 
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Based on the simulation studies, four main conclusions can be drawn. First, the 
clustering coefficient of our networks derived from independent random time series 
is typically larger than those of corresponding ER networks. The clustering coeffi- 
cient is higher the larger the amount of low frequency contributions, the smaller 
the length of time series, and the smaller the edge density (cf. figure \5?]} . Second, 
like the clustering coefficient, the average shortest path length of our networks is 
larger the higher the amount of low frequency contributions, and the smaller the 
length of time series (cf. figure I5.2[) . We mention that the average shortest path 
length as defined in equation (|2.6[) depends non-trivially on the amount of low-fre- 
quency contributions: with the amount of low frequency contributions, the number 
of connected components increases (cf . figure \5A} , Nc — > N, which in turn leads 
to L ^ 0. Since, for small edge densities, the clustering coefficient deviates more 
strongly from those of ER networks (7 > 2) than the average shortest path length 
(A < 1.2), our networks would be characterized as small- world networks (cf. sec- 
tion 12.1.21 and chapter H]). Third, our networks become more assortative the higher 
the amount of low frequency contributions, the smaller the length of time series, 
and the smaller the edge density (cf. figure I5.3[) . Nodes with a high (low) degree 
are preferentially linked to nodes with a high (low) degree. Thus, taking into ac- 
count that our networks are derived from random time series, our networks show 
degree-degree correlations (see section I2.1.1|) as opposed to ER or generalized ran- 
dom graphs representing uncorrelated random networks. Fourth, we observed the 
amount of low-frequency contributions as well as of the length of time series to 
have a similar influence on the clustering coefficient, average shortest path length, 
and on the assortativity coefficient. Differences can be observed, however, in the 
number of connected components, in the connectivity condition, and in the degree 
distributions. These properties are equal (within the errors of the simulation) to the 
ones of ER networks for our networks with M = 1 but different length of time 
series. In contrast, increasing the amount of low-frequency contributions leads to a 
higher number of connected components than ER networks and to degree distribu- 
tions and connectivity conditions deviating strongly from those of ER networks. 

5.2 Field data analysis 

Spatial and temporal changes in frequency content can typically be observed in 
field data reflecting the dynamics of complex systems. As a prototypical example 
well known for its notoriously complex changes in frequency content 11190141931 , we 
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here analyze electroencephalographic (EEG) recordings of epileptic seizures. The 
aim of this section is threefold: first, we study whether the influences illustrated in 
the simulation studies can also be observed in field data. We restrict the time-re- 
solved analysis to network properties often assessed in field studies, namely to the 
clustering coefficient, the average shortest path length, and the assortativity coeffi- 
cient. In addition, we focus on the influence of the spectral content of time series 
on network properties. Second, the model used throughout the simulation studies 
assumes time series to possess, on average, the same frequency content (homogene- 
ity assumption). This assumption is usually not fulfilled in field studies where the 
spectral content of time series recorded from different parts of the system may differ 
substantially. We investigate whether findings observed in the simulation studies 
carry over to field studies where time series possess different spectral contents. For 
this purpose, we define two ensembles of random networks which are generated in 
a data-driven way mimicking the empirical time series in different degrees of de- 
tails. Third, we depict a methodological framework which can help to distinguish 
network properties of interdependence structure reflecting the dynamics of a com- 
plex system from those structures spuriously induced by the applied methods of 
analysis. 



5.2.1 Description of data and steps of analysis 

We analyze multichannel EEG recordings from 60 patients f| capturing 100 epileptic 
seizures reported in references Il59ll235l . During presurgical evaluation of drug-re- 
sistant epilepsy, the data were recorded from the cortex and other relevant struc- 
tures of the brain using implanted strip, grid, or depth electrodes (N = 53 ± 21 
channels). The EEG data were sampled at 200 Hz within the frequency band 0.5- 
70 Hz using a 16-bit analog-to-digital converter. Electroencephalographic seizure 
onset and end were detected automatically II235L For each channel and recording, 
the data were divided into consecutive, non-overlapping windows of 2.5 s duration 
(T = 500 sampling points). Time series of each window were normalized to zero 
mean and unit variance for each channel separately. 

We derive networks by thresholding values of estimators of signal interdepen- 
dence (using e = 0.1) as in the previous section. In order to study whether the 
influences identified in the simulation studies depend on the chosen estimator of 



^ All patients had signed informed consent that their clinical data might be used and published 
for research purposes. 
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signal interdependence when analyzing field data, we use the absolute value of the 
correlation coefficient and the maximum value of the absolute cross correlation 
(cf. section 12.2.1 [) ■ Characteristics of networks based on p*^ or are denoted as 
Cc, Lc, flc or Cm, Lm, flm/ respectively. We omit the notation of the window index in 
order to facilitate the presentation of results. 

To assess time-resolved network characteristics of all 100 epileptic seizures, we 
determine averages of network characteristics as follows: since seizures vary in 
length (mean seizure duration: 110 ± 60s), we normalize seizure durations by par- 
titioning each seizure in 10 equidistant time bins (similar to reference [59]). Thus, 
each data window and its associated network characteristic within a seizure is as- 
signed to a time bin. In addition, we define a pre-seizure and a post-seizure time 
bin which both contain the same number of data windows. Time-resolved network 
characteristics of all 100 epileptic seizures are obtained by averaging over the re- 
spective network characteristics contained in a time bin. We denote the quantities 
obtained this way as Q, Lc, Uc or as Cm, Lm, ^m- 

We study the influence of the spectral content of time series on network prop- 
erties by comparing their values to those obtained for two ensembles of random 
networks. Networks of both ensembles are based on random time series which 
mimic properties of the EEC time series at two different levels of detail. The first 
random network ensemble is based on random time series with a spectral con- 
tent which is approximately equal to the mean spectral content of EEC time series 
within a window. Thus, the construction resembles the one used in our model stud- 
ies but allows to incorporate spectral contents that are found in empirical data. For 
a given patient, consider a window and let N denote the number of time series 
contained in this window. The periodogram P;(/) is estimated for each time se- 
ries i, and the mean power spectral density is determined, P{f) = N~^X^, P/(/). 
We generate N random time series of length T = 500 whose entries are drawn 
from the uniform probability distribution U (see section |53]>- Each of these random 
time series is filtered in the Fourier domain using \JP{f) as filter function, and 
we normalize the filtered time series to zero mean and unit variance. From these 
time series, we derive a network based on p^ or p"" using e = 0.1 and determine the 
network characteristics (clustering coefficient, average shortest path length, assorta- 
tivity coefficient). In total, 20 realizations of the network are generated and network 
characteristics are determined. The average of the respective network characteristic 
over the 20 realizations is denoted as L^c \ ^^c \ or as Cm , Lm , i^nx ■ This way, 
we determine network characteristics for each window and each patient. 
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Figure 5.7: (Left) Relative amount of power contained in the 5- (Pj, black), d- (P^, 
blue), a- (Pa, green), and /3- (P^, red) frequency bands during an exemplary seizure 
(N = 66). Profiles are smoothed using a four-point moving average. Grey-shaded 
area marks the seizure. (Right) Mean values (Pj, P^, Pa, P^) of the relative amount 
of power averaged separately for pre-seizure, discretized seizure, and post-seizure 
time periods of 100 epileptic seizures. Lines are for eye-guidance only. 



With the second random network ensemble, we take into consideration that the 
spectral content of LEG time series capturing signals from different brain regions 
may differ considerably. Networks of this ensemble are derived from univariate 
time series surrogates 11201112361 that are random but possess power spectra and 
amplitude distributions which are practically indistinguishable from those of the 
EEG time series: to generate a surrogate, amplitudes of an EEG time series are 
iteratively permuted while the power spectrum is approximately preserved. This 
randomization scheme is known to destroy any significant linear or non-linear 
dependencies between time series and has been frequently used to test the null 
hypothesis of independent linear stochastic processes. For each patient and each 
window, we generate 20 realization of random networks {e = 0.1) and determine 
their network characteristics. The mean of the respective network characteristics is 

, . , ^(2) ,(2) (2) ^(2) .(2) (2) 

denoted as Q , Lc , , or as Cm , i-m / ■ 



5.2.2 Spectral contents of data 

To gain insight into a possible influence of the spectral content of time series on net- 
work properties, we characterize the time-dependent spectral content of the EEG 
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recordings. The relative amount of power contained in the S- (0-4 Hz, Ps), d- (4- 
8 Hz, P^), a- (8-12 Hz, P^.), and /3- (12-20 Hz, P^) frequency bands is determined 
from P(/) (cf. section |5.2.1[) for each patient and each data window. For an exem- 
plary recording of a seizure, we show in figure 15.71 (left) the temporal evolution 
of the relative amount of power in different frequency bands. Prior to the seizure, 
more than 50 % of the total power is contained in the ^-band, i.e., in low frequen- 
cies. This amount is nearly halved during the seizure while the relative amount of 
power in higher frequency-bands is enlarged compared to the pre-seizure time in- 
terval. At seizure end, the total power is shifted back towards low frequencies, and 
we observe Ps to be even higher than prior to the seizure. The mean values of the 
relative amount of power {Ps, P^, Pa, P^) obtained for all seizure recordings shown 
in figure 15.71 (right) support this finding: we observe a shift of the total power from 
low frequencies prior to seizures towards higher frequencies during seizures and 
back towards low frequencies after seizures. 

5.2.3 Clustering coefficient and average shortest path length 

Figure 15.81 shows the temporal evolution of the clustering coefficient and the aver- 
age shortest path length based on p'^ (top panels) or p"^ (bottom panels) obtained 
for an exemplary recording of a seizure. During the seizure, network characteris- 
tics Cc, Cm as well as and show pronounced differences when compared to 
the network characteristics obtained from both random network ensembles. These 
differences are smaller prior to and after the seizure, and they nearly vanish for Cm 
and Cm^ as well as for Lm and L^^. Cc^-* and Cm"* decrease during the seizure and 
increase already prior to seizure end where they remain at an elevated level com- 
pared to the pre-seizure period. These changes resemble the temporal evolution of 
the relative amount of power in the ^-band, Ps (cf. left panel of figure 15. 7[) . This 
similarity corroborates the results obtained in our simulation studies, namely that 
the clustering coefficient of our random networks is higher the larger the amount 
of low frequency contributions in the time series. Findings obtained in the simula- 
tion studies also indicate that the average shortest path length is influenced by the 
frequency contents of time series to a lesser extent than the clustering coefficient. 
This result is also supported by ' and Lm which both vary little over time. Only 
after the seizure, L^^-* is slightly increased and reflects the high amount of power in 
the (5-band. 
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Figure 5.8: Network properties Cc and Lc (top row, black lines) as well as Cm and 
(bottom row, black lines) during an exemplary seizure (cf. figure 15.71 (left)). Mean 
values and standard deviations of network properties obtained from surrogate time 



series (C, 



(2) 



r(2) Al) 



(2) 

Lfn ) are shown as blue lines and blue shaded areas, respec- 



tively, and mean values and standard deviations of network properties obtained 

from the overall spectral content model {C^c \ L^c \ , L^) are shown as red lines 
and red shaded areas, respectively. Profiles are smoothed using a four-point moving 
average. The grey-shaded area marks the seizure. For corresponding Erdos-Renyi 
networks, Cer ~ 0.1 and Ler ~ 2.4 for all time windows. 
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The clustering coefficients obtained from the two random network ensembles, 
Cc^"* and differ only slightly from each other. The same can be observed for 
the average shortest path length L^^ and The slight differences appear to be 
systematic, which is reflected in Q <Cc and >Lc for many windows. This 
suggests that both random network ensembles are equally suited for characteriz- 
ing the influence of the amount of low-frequency contributions on the clustering 
coefficient and on the average shortest path length if interaction networks are de- 
rived from p^. In contrast, we observe differences between both random network 
ensembles in clustering coefficient and average shortest path length if network con- 
struction is based on p"^. The differences between and Cm'* as well as between 
L^^ and are most pronounced during the seizure and for L^^ and also after 
the seizure. These findings indicate that clustering coefficient and average shortest 
path length of networks based on p"^ intricately depend on the spectral content of 
individual EEG time series recorded from different brain regions. For these inter- 
action networks, the second random network ensemble accounting for the complex 
changes in spectral contents of different brain regions appears to be more suited to 
characterize the influence of low-frequency contributions on clustering coefficient 
and average shortest path length. 

The temporal evolution of mean values of C and L over all seizures is shown 

-(I) -(2) - (1) -(2) -(1) - (1) 

in figure |5^ Network characteristics Q , Q , , , , and decrease 
during seizures and increase already prior to seizure end, which roughly reflects 
the temporal changes of the relative amount of power in the ^-band, Ps (cf. right 

-(I) -(2) 

panel of figure |5.7|| . As in the case of the exemplary seizure recording, and 

- (1) - (2) 

as well as and follow similar courses in time which appear to be system- 
atically shifted along the ordinate. We observe differences between both random 
network ensembles for characteristics of interaction networks based on p^, namely 
for Cm"* and Cm'* as well as for and . These findings are in agreement with 
the ones obtained for the exemplary recording of a seizure. This indicates that in- 
deed the clustering coefficient and the average shortest path length of interaction 
networks based on p^ depend more sensitively on the spectral contents of indi- 
vidual EEG time series recorded from different brain regions than the respective 
quantities derived from p'^. 

The courses in time of and Lm resemble each other showing an increase during 
seizures and a decrease at seizure end. In contrast, while Cc and Cm increase at the 
beginning of the seizures. Cm decreases at the end of the seizures, where the average 
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Figure 5.9: Mean values (black) of network properties Q (top left), Lc (top right). Cm 
(bottom left), and Lm (bottom right) averaged separately for pre-seizure, discretized 
seizure, and post-seizure time periods of 100 epileptic seizures. Mean values of 
corresponding network properties obtained from the first and the second ensemble 
of random networks are shown as red and blue lines, respectively. All error bars 
indicate standard error of the mean. Lines are for eye-guidance only. 
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Figure 5.10: Mean values of Cq/Cq and Cm/Cfj{ (left) as well as Lq/V^q and 

(2) 

Lm / (^ight) averaged separately for pre-seizure, discretized seizure, and post- 
seizure time periods of 100 epileptic seizures. All error bars indicate standard error 
of the mean. Lines are for eye-guidance only. 



amount of power in low-frequencies is large, and Cc stays at an elevated level. The 
corresponding quantities obtained from the second random network ensemble for 

-12) 

networks based on p'^ and p"^ also show a different behaviour: while Cjn does not 
increase at the end of the seizures but fluctuates around 0.3 ± 0.01, Q increases 
at the end of the seizures and traverses an interval of values roughly three times 
larger than the interval containing values of ■ All in all, these findings suggest 
that indeed the values of the clustering coefficient and of the average shortest path 
length are influenced by the pronounced changes of the spectral content of EEG 
time series observed during epileptic seizures. 

We continue by comparing values of the clustering coefficient and average short- 
est path length with those obtained for our random networks. In the case of Erdos- 
Renyi networks, such a comparison is often realized in various studies by calculat- 
ing the ratio of the value of the network characteristics to the value obtained for 
corresponding ER networks. Since clustering coefficient and average shortest path 
length of ER networks do not change over time (for constant edge density), such 
a comparison just rescales the quantities by a constant factor and thus only shifts 
the curves shown in figure |5^ along the ordinate. We take into account the varying 
frequency content of time series and calculate the ratios of the clustering coefficient 
and the average shortest path length to their corresponding values obtained from 
the second random network ensemble. These normalized quantities are shown in 
figure 15.101 and describe a concave-like movement over time which indicates a re- 
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Figure 5.11: Top row: Assortativity coefficients Uc and flm (black lines) during an 
exemplary seizure (cf. figure |52l (left)). Mean values and standard deviations of net- 

(2) (2) 

work properties obtained from surrogate time series {a)- ' , a)^ ) are shown as blue 
lines and blue shaded areas, respectively, and mean values and standard deviations 

of network properties obtained from the overall spectral content model {a^c \ 
are shown as red lines and red shaded areas, respectively Profiles are smoothed 
using a four-point moving average. The grey-shaded area marks the seizure. For 
corresponding Erdos-Renyi networks, flgR = —0.04 ± 0.02 for all time windows. 
Bottom row: Mean values (black) of network properties Uc (left), flm (right) aver- 
aged separately for pre-seizure, discretized seizure, and post-seizure time periods 
of 100 epileptic seizures. Mean values of corresponding network properties ob- 
tained from the first and the second ensemble of random networks are shown as 
red and blue lines, respectively. All error bars indicate standard errors of the mean. 
For corresponding ER networks, flgR ~ —0.06 ± 0.01 for all time bins. Lines are for 
eye-guidance only. 
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configuration of networks: From more random topologies before seizures towards 
more regular (during seizures) and back towards more random network topolo- 
gies. Our findings thus support results reported in an earlier study [59] in which a 
different and seldom used thresholding method was employed. 



5.2.4 Assortativity 

For an exemplary recording of a seizure, the temporal evolution of the assortativity 
coefficient of interaction networks based on p'^ and is shown in the top panels 
of figure 15.111 Compared to the clustering coefficient and the average shortest path 
length (cf . figure 15. 8|) , Uc and appear to fluctuate stronger during the recording. 
We observe flm — and to a lesser extent Uc — to be increased during the seizure and 
to take on lower values before and after the seizure. The assortativity coefficient 
derived from the first random network ensemble, a'^^ slightly increases at the end 
of the seizure, reflecting the increased amount of low frequency contributions in 
the time series. In contrast, we do not observe such a behaviour for which 
fluctuates around some value during the recording. Remarkably, the assortativity 
coefficient derived from the second random network ensemble, Am , closely follows 

(2) (2) 

flm after the end of the seizure, which is similar to the behaviour of Cm and 
with respect to Cm and Lm (see figure I5.8[) . 

The bottom panels of figure 15.111 show the mean values of assortativity coeffi- 
cients obtained for all 100 seizures. The average values reveal structures which are 
partially hidden by fluctuations observed on the level of individual seizure record- 
ings: flc and flm are increased during seizures and show lower values before and 

-(1) 

after the seizures. Concerning the first random network ensemble, we observe 
and flm to roughly reflect the course in time of the relative amount of power in 
the ^-band (cf. figure 15. 7|) , which can be expected due to the findings obtained in 

(2) (1) 

the simulation studies (cf. figure |5.3[) . flc and take on similar values over time, 
and both increase at the end of the seizures. In contrast, the temporal evolution of 
flm'' differs from , which indicates that the assortativity coefficient based on p"^ 
depends sensitively on the different spectral contents of EEC time series recorded 
from different brain regions. 

We are not aware of a common way agreed upon in the literature to compare the 
values of the assortativity coefficient with those obtained from random networks. 
Determining the ratio a/a^^^ appears to be not well suited for values defined on 
the interval [—1, 1] which can, in addition, fluctuate around zero (as is the case for 
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Figure 5.12: Difference values a^^^ and a[^^ for pre-seizure, discretized seizure, and 
post-seizure time periods of 100 epileptic seizures. All error bars indicate standard 
error of the mean. Lines are for eye-guidance only. 



.(2) 



flc ' and flm )• Here we refrain from developing a sophisticated method allowing 
for a comparison between assortativity indices but instead define a tentative index, 
namely the difference 



-(D) 



and 



-(D) 



(5.15) 



These quantities are shown in figure I5l2l a[^^ and fl^ have a similar course in 
time indicating a gradual increase of the assortativity during the seizures and a 
sudden decrease at the end of the seizures. This indicates that the interaction net- 
works during seizures display topologies which are more assortative than the ones 
obtained before and after the seizures. 



5.3 Discussion 

In this chapter, we studied the influence of the finite length and the frequency con- 
tent of time series on properties of derived interaction networks. The network ap- 
proach to multivariate time series analysis assumes the studied dynamics to be well 
represented by a model of mutual relationships (i.e., a network), in which edges 
reflect interactions between subsystems (nodes). We studied interaction networks 
derived from time series of independent processes, which would not advocate the 
representation by a model of mutual relationships. Remarkably, these networks dis- 
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played non-trivial topologies which did not reflect interactions between subsystems 
but were solely induced by the flnite length and the frequency content of the time 
series and by the way how networks are derived from empirical data. The length of 
time series (i.e., the number of data points) and the temporal sampling frequency 
determine the observation duration which has to be chosen such that it allows for 
a reliable identification of interactions between subsystems. This choice becomes 
non-trivial if typical time-scales of the dynamics are unknown a priori. In addi- 
tion, if pursuing a time-resolved analysis, to achieve a better temporal resolution, 
it is tempting to increase the sampling frequency while keeping the length of time 
series per window constant. If done irrespectively of the typical time scales of the 
studied dynamics (oversampling), this will likely yield time series with an artificially 
increased amount of temporal correlations reflected in slower decaying autocorrela- 
tion functions and, equivalently, in a larger amount of low-frequency contributions. 
These artificial temporal correlations can induce structures in interaction networks 
derived from the time series. Taken together, the question then arises as to how 
informative network analysis results are with respect to the studied dynamics. This 
question can be addressed by defining and making use of appropriate null models 
of which we discuss the most frequently employed ones in the following. 

Erdos-Renyi (ER) networks have found frequent use as null models in field stud- 
ies. We recall (cf. section l2.1.2[) that in ER networks, possible edges are equally likely 
and independently chosen to become edges. Using this null model, interaction net- 
works can be tested whether they comply with the notion of such random networks. 
In our interaction networks derived from time series generated by independent pro- 
cesses, possible edges are equally likely but not independently chosen to become 
edges, which can be deduced from the behaviour of the clustering coefficient (cf. 
section I5.1.1|) . We observed the clustering coefficient C, the average shortest path 



length L, and the assortativity coefficient a of our interaction networks to clearly 
differ from those of corresponding ER networks. A comparison of C and L to those 
of ER networks, as pursued in numerous field studies, will likely lead to a classifi- 
cation of our networks as small-world networks. Compared to ER networks, which 
are uncorrelated random networks, our networks are likely classified as assortative 
networks: the analysis methodology alone can readily induce degree-degree corre- 
lations which are, by construction, not present in ER networks (apart from effects 
due to the finite size of networks). Taken together, a comparison of properties of 
interaction networks with those of ER networks is likely to yield spurious find- 
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ings which are not related to the studied dynamics but to the way how interaction 
networks are derived from finite empirical data. Since the ER model does not ac- 
count for the latter, it may not be well suited as null model for interaction networks 
derived from multivariate time series. 

Another null model is based on randomization of a network topology while the 
degrees of nodes are preserved ll86llll2l[TT3l (cf . section I2.1.2[ generalized random 
graphs). We recall that this model can be used to test whether an interaction net- 
work under consideration is random under the constraint of a given degree se- 
quence. Although we did not directly investigate this model in this chapter, our 
findings allow us to draw substantial conclusions about its usefulness for inter- 
action networks derived from empirical data: the structures induced by the way 
how networks are derived from finite time series cannot be reflected in the de- 
gree sequence only. This result is based on the observation that C, L, and a pro- 
nouncedly depended on the finiteness of the data (length of time series T) while 
the degree distribution did not (cf. figure |5]6] (a-c), M = 1). This behaviour might 
be explained by degree-correlations which do not manifest themselves in the de- 
gree distribution. Indeed, it has been argued in the literature that the clustering 
coefficient and the average shortest path length can be influenced by degree-corre- 
lations [122[|123lll27[|129i . In this context, we observed the assortativity coefficient. 



which is indicative of degree-degree correlations in the network, to sensitively de- 
pend on the length of time series as well as on the amount of low-frequency contri- 
butions (cf . figure 15. 3|) . On the other hand, for a constant length of time series, we 
observed the degrees of nodes to be correlated with the relative amount of low-fre- 
quency contributions in the time series (as parametrized by M, cf. figure 15.61 (d)). 
Thus, we expect the degree distribution to at least partially reflect the frequency 
contents of the underlying time series. If our interaction networks were uncorre- 
cted (no degree-correlations), this finding would advocate the use of degree-pre- 
serving randomized networks as null model. Since our results clearly show that 
degree-degree correlations can already be induced by the analysis methodology 
applied to finite data, we consider degree-preserving randomization of networks, 
which yields — ^by construction — uncorrelated random networks, not well suited for 
serving as null model for interaction networks. This view is corroborated by a de- 
bate in which the usefulness of degree-preserving randomized networks as null 
model was questioned because they do not take into account different character- 
istics of the data and its acquisition II2371I238II . Finally we mention that the edge- 
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switching algorithm widely employed to generate degree-preserving random net- 
works is known to non-uniformly sample the space of networks with predefined 
degree sequence (see, e.g., references llllOUlllll '). Alternative randomization schemes 
have been proposed which can overcome this deficiency (see, e.g., Ill08lll09lllll| and 
references therein). 

We propose a null model which takes into account the way how networks are 
derived from empirical time series of finite length and of individual frequency con- 
tent. To this end, we apply the same analysis steps as in typical field data studies 
(estimation of signal interdependence, thresholding of interdependence values to 
derive edges) and use surrogates II2011I236II of the empirical time series to derive 
networks. These surrogate time series comply with the null hypothesis of indepen- 
dent linear stochastic processes and preserve length, amplitude distribution, and 
frequency content of the original time series (second random network ensemble in 
section 15.2. In our simulation studies, we observed C, L, and a of such networks 
to be higher the larger the amount of low-frequency contributions, the shorter the 
length of time series, and the smaller the edge density. Regarding the connectiv- 
ity condition, the minimum edge density e* for which a network is connected was 
higher the larger the amount of low-frequency contributions but appeared to be 
independent of the length of time series. The influence of the frequency content on 
the values of C, L, and (to a lesser extent) a was confirmed by results obtained from 
analyzing multichannel EEG recordings of 100 epileptic seizures. Findings reported 
in an earlier publication (cf. figure 2c in reference 11591 ) show that the minimum edge 
density e* increases at the end of the seizures where the relative amount of low- 
frequency contributions increases. This supports our findings obtained from the 
simulation studies. By comparing properties of interaction networks with those of 
our random networks, we were able to distinguish aspects of the network dynamics 
during seizures from those spuriously induced by the methods of analysis and by 
the finite length and spectral content of time series. 

Our findings are of particular relevance to numerous field data studies assessing 
and interpreting global as well as local characteristics of interaction networks. Our 
random networks are likely classified as small-world networks when comparing 
values of C and L with the ones of corresponding ER networks. This might indicate 
that the small-world characteristic of interaction networks derived from empirical 
data as reported in an ever increasing number of studies could partly or solely be 
related to the finite size and individual frequency contents of time series. In this 
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regard, our proposed null model can be of interest for studies in which short time 
series with large amount of low-frequency contributions are investigated, which is, 
for example, the case in resting state functional magnetic resonance imaging stud- 
ies (see, e.g., references ll65ll219ll239H242| V The same applies to studies assessing 
the assortativity of interaction networks (see, e.g., references H65]47TI '). Concerning 
local network characteristics, our observations of correlations between the degree 
of nodes and the relative amount of low-frequency contributions in the respective 
time series has important implications. The node degree has been frequently used 
to characterize the centrality of a node (see ||6l l243| and references therein) within 
a network and to identify hubs (nodes which are highly central). If findings of 
hubs could be partially or solely be attributed to the individual frequency contents 
of time series, hubs would be an overly complicated representation of features al- 
ready present on a single time series level. The same holds true for other network 
characteristics including the ones investigated here. We are confident that using 
our null model can help to unravel global as well as local network characteristics 
related to the studied dynamics from those spuriously induced by the finite length 
and the frequency contents of the time series and by the methods used to derive 
networks. 

Results of our field data analysis show that network characteristics depend also 
on the time series analysis method employed to infer edges. This dependence was 
intricately related to differences in frequency contents among time series: in the 
simulation studies, all time series were assumed to possess approximately the same 
frequency content (homogeneity assumption), whereas the frequency contents of 
time series of the seizure recordings can vary considerably among each other (het- 
erogeneity of spectral contents). In our simulation studies, network characteristics 
C, L, and a showed qualitatively the same dependence on the length of time se- 
ries, the amount of low-frequency contributions, and the edge density for networks 
based on thresholding absolute values of the correlation coefficient (p*^) or of the 
maximum cross correlation (p™), respectively. For the seizure recordings, if net- 
work construction was based on p*^, the dependence of these network character- 
istics on the relative amount of low-frequency contributions was qualitatively the 
same as in the simulation studies (see first random network model, section I5.2.1[) . 
This observation suggests that estimating the mean spectral content of empirical 
time series can help the experimentalist to tentatively assess the potential relative 
increase of C, L, and a in different networks based on p*^. This rule of thumb will 
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not be useful for networks based on p^, for which we observed a sensitive de- 
pendence on the heterogeneity of spectral contents of EEG time series (see second 
random network model, section |5.2.1[) . In regard to the latter, we consider future 
investigations promising that address the question, which aspects in the definition 
of jo'^ and exactly leads to the observed difference in the sensitive dependence 
on the heterogeneity of spectral contents. 

Conclusions can also be drawn for a network construction technique which relies 
on significance testing in order to derive edges II151I . For this method, null distri- 
butions of the estimator of signal interdependence (p^) are generated for each pair 
of time series. An edge is established if the null hypothesis of independent pro- 
cesses generating the time series can be rejected at a prespecified significance level. 
In order to reduce the computational burden for generating such null distributions, 
it was suggested to restrict the creation of null distributions to a limited subset of 
time series only II151II . However, our findings indicate that networks constructed 
this way will yield an artificially increased number of false positive or false nega- 
tive edges. This number will likely depend on the relative spectral contents of time 
series being part or not part of the subset. 

Finally we mention that our results might also be of value for network modeling. 
The simulation studies demonstrate that networks can be generated whose network 
characteristics C, L, and a are approximately equal but whose degree distributions 
and connectivity conditions differ. Such networks can be produced by choosing a 
threshold and generating time series obeying the relation between the size of the 
moving average and the length of time series. 

We close this chapter by summarizing its main contributions: first, we found that 
the finite length and the frequency content of time series together with the com- 
monly used methods to define edges can induce non-trivial structures in derived 
interaction networks. These structures do not necessarily reflect mutual interactions 
between subsystems and will likely lead to a classification of a network as small- 
world and assortative. Second, to distinguish network structures related to the dy- 
namics from those spuriously induced by the analysis methodology, we proposed a 
null model which incorporate knowledge about the way how interaction networks 
are derived from empirical data (second random network ensemble, section 15.2. 
Our approach is data-driven and yields random networks with non-trivial topolo- 
gies solely related to the methods of analysis, the finite amount of available data, 
and the spectral content of time series. It can be regarded as an instance of a general 
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framework which allows for the generation of random networks by implementing 
the null hypothesis already on the time series level. Third, to assess the relevance 
of our findings for field data analysis, we investigated multichannel EEG record- 
ings capturing 100 epileptic seizures which are known for their complex spatial 
and temporal dynamics. Results indicate that the pronounced changes of the fre- 
quency content during seizures are reflected in network properties. This influence 
sensitively depended on the chosen method to estimate signal interdependence. By 
using our null model, we were able to distinguish properties of interaction net- 
works related to seizure dynamics from those spuriously induced by the analysis 
methodology. Fourth, our findings open up the way to promising research direc- 
tions. For example, we restricted our investigations to frequently used network 
characteristics, but we expect also other network properties to be affected by the 
identified influences. Most of our results were based on numerical studies, but an- 
alytical approaches can be expected to complement our findings and advance the 
understanding of exact interrelationships between properties on the level of time 
series and properties of interaction networks. Moreover, our proposed framework 
for generating random networks can be extended or changed in various parts in 
order to meet different demands. This possibility allows one to study different 
network construction techniques other than thresholding (e.g., networks based on 
minimum spanning trees II153I or weighted networks), or different non-linear and 
linear methods for estimating signal interdependence Ill32[|133lll37i . Finally, em- 
ploying other surrogate concepts on the level of time series II244I42491 allows one to 
define different random networks which may prove useful for various purposes. 
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Concepts from network theory have been applied in various scientific disciplines 
and can advance our understanding of the dynamics of complex systems. Results 
obtained in an ever increasing number of field studies revealed richly structured 
topologies (including small-world characteristics and assortativity) of interaction 
networks derived from spatially extended systems. The inference of such networks 
is based on empirical data and relies on the spatial and temporal sampling of the 
dynamics. A key challenge of this approach and an inevitable prerequisite for the 
interpretation of results is to reliably assess whether characteristics of the interac- 
tion networks are significant or not and whether they indeed reflect properties of 
the dynamics. In this thesis, we investigated whether and how the spatial and tem- 
poral sampling of the dynamics together with commonly applied methods for edge 
inference influence the properties of interaction networks and affect the assessment 
of the significance of findings. In modeling and numerical studies, we identified 
factors which easily influence network properties and which are not related to 
the dynamics but to the spatial and temporal sampling together with the analy- 
sis methodology used to infer networks from empirical data. These findings were 
supported by results obtained from our field studies of brain functional networks. 
We developed and proposed strategies which can help to distinguish properties 
of interaction networks related to the dynamics from those spuriously induced by 
the identified influences. Our findings related to small-world characteristics and as- 
sortativity call for a careful reconsideration and reinterpretation of analysis results 
reported in earlier studies in diverse scientific fields. Moreover, our results indi- 
cate that also other network characteristics (such as centralities or communities) are 
affected by the identified influences. 

The network approach towards the analysis of the dynamics of complex systems 
comes along with several assumptions — often made implicitly — about what is in- 
teracting, how interaction takes place, and on which temporal and spatial scales 
the dynamics unfold. These assumptions manifest themselves in different ways, 
for example when deciding about the type and number of sensors and where to 
place them, or when choosing an observation duration and sampling frequency. On 
the network level, these assumptions translate into the challenges of how to iden- 
tify nodes and edges. Whereas these questions can be straightforwardly answered 
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for various systems (e.g., electric power grids), they pose a non-trivial challenge 
for many natural systems (e.g., in climate science, earth science, or in the neuro- 
sciences). 

The spatial sampling is crucial for the identification of nodes and edges of inter- 



action networks (cf. section |43] for an in-depth discussion). Since nodes are usually 
associated with sensors when inferring interaction networks, missing to sample the 
dynamics of a subsystem or accidentally sampling the dynamics of the same sub- 
system (i.e. a common source) with two or more sensors can remarkably change the 
topology of derived interaction networks. As we demonstrated (cf. section |4j2]), the 
presence of common sources leads to an artificial increase of the clustering coefficient 
if using commonly employed time series analysis techniques to infer edges. More- 
over, frequently used time series analysis techniques cannot distinguish between 
direct and indirect interactions, which represents an additional mechanism for an 
artificial increase of the clustering coefficient. If the data is contaminated with noise 
contributions, which is often unavoidable in empirical studies, the average shortest 
path length is likely to be artificially decreased due to uncertainties arising from 
the identification of edges. Taken together, this yields interaction networks which 
possess a small-world topology even if the actual underlying interaction structure 
is not small world (cf . section I4.2[) . Moreover, such interaction networks are prone 
to be classified as assortative networks even in cases in which the actual inter- 
action structure is dissortative (cf. section I4.2|) . We identified several strategies to 
approach the aforementioned issues. On the network level, data-driven node-merg- 
ing strategies II291I221II could account for "redundant" nodes which represent the 
same subsystem, and network characteristics could be developed which take into 
account spatial correlations present in the data II2051I226I . On the level of time series, 
some analysis techniques II2001I2071I208II (cf. section |4.3[) might be capable of dis- 
tinguishing between signal interdependencies due to interacting subsystems and 
those due to sampling a common source. Other techniques may be able to distin- 
guish between direct and indirect interactions Ill33il210l - I212ll227l - I229| . Finally, on 
the system level, an improved determination of the actual structural organization 
may help to design suitable sensor placement strategies. 

Improving the determination of the actual structural organization of a system 
may not be applicable in cases in which separate entities (subsystems) cannot 
be unambiguously defined. The network approach then superimposes a model 
on the data which does not necessarily match the organization of the underly- 
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ing system. For instance, if the system is characterized by a physical field (e.g. 
pressure, temperature, electric or magnetic field), a decomposition of the system 
into subsystems represents a coarse graining of the dynamics and may introduce 
spatial correlations in the topology of interaction networks. Care should be used 
(and awareness is already developing in some studies, see, for example, refer- 
ences 11391 [T99ll205ll2T9ll225ll250ti25a to ensure that assessed network characteris- 
tics do reflect properties of the dynamics and not properties solely arising from 
the applied coarse graining scheme (e.g. from the arrangement of sensors, cf. sec- 
tion I4.2[) . If the spatial sampling does not change during the acquisition of data, a 
time-resolved network analyses which strictly focusses on relative changes of net- 
work properties over time can represent an approach to exclude potential spatial 
sampling effects. While relating features of interaction networks to those of the un- 
derlying dynamics might still be challenging, the network approach can nonethe- 
less be used as a powerful tool to achieve information reduction when analyzing 
multivariate time series obtained from a multitude of sensors. 

The temporal sampling of the dynamics plays an important role for the identifi- 
cation of edges (cf. section |53l for a thorough discussion). In numerical studies (cf. 
section |5^ , we found that the finite length of time series (as determined by the choices 
of observation duration and sampling frequency) as well as the amount of low-fre- 
quency contributions can lead to spurious properties in derived interaction networks 
if frequently employed methods for edge identification (thresholding estimators of 
signal interdependence) are used. This even holds true in cases in which the system 
is appropriately spatially sampled and an unambiguous identification of nodes is 
possible. We investigated interaction networks that were derived from time series of 
independent stochastic processes. The latter would not advocate a representation 
by a network which is a model of mutual relationships. Remarkably, the result- 
ing interaction networks showed non-trivial structures which deviated from those 
of random (Erdos-Renyi) networks. This deviation was stronger the smaller the 
length of time series or the larger the amount of low-frequency contributions. Next 
to influences on the degree distribution and connectedness of networks, we found 
these networks to likely show small-world and assortative network characteristics. 
We consider these findings to be of particular interest for studies in which network 
inference is based on short time series (e.g. time-resolved network analyses aiming 
at high temporal resolutions or studies based on notoriously short time series such 
as fMRI or financial data). Different strategies can be pursued to address the afore- 
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mentioned issues. On the system level, an improved determination of the temporal 
scales on which the dynamics unfold may help to guide choices related to tempo- 
ral sampling schemes and subsequent steps of data analyses. On the time series 
level, significance testing using null distributions of the employed estimator of sig- 
nal interdependence for the inference of edges can help to control the probability of 
spurious edges II1511I2021I214I and may thus reduce spurious properties in derived 
interaction networks. On the network level, a comparison of interaction networks 
with those obtained from network null models that take into account how interac- 
tion networks are derived from empirical data can help to distinguish properties 
reflecting characteristics of the dynamics from those spuriously induced. We devel- 
oped such a network null model and demonstrated its usefulness when studying 
seizure dynamics in epilepsy patients (cf. section |5^ . 

Ensembles of random networks are typically employed as network null models to 
assess whether findings obtained by the network approach are significant or not (cf. 
section l53l for a detailed discussion). These models always encode an expectation of 
what can be assumed to be present "by chance". Most field studies rely on the very 
same random network ensembles, namely on degree-preserving randomized net- 
works or on Erdos-Renyi (ER) networks, and thus implicitly share the same "null" 
expectation (e.g. for ER networks: edges are equally likely and independently cho- 
sen to become edges). If one aims to interpret features of interaction networks and 
to gain a better understanding of the dynamics of spatially extended systems, our 
findings call for the development and use of more sophisticated null models which 
take into account the way (spatial sampling, temporal sampling, employed time se- 
ries analysis techniques and strategies towards edge inference) interaction networks 
are derived from the dynamics of the system. We demonstrated a basic network null 
model accounting for the spatial arrangement of sensors (cf . section 14.11 and refer- 
ence II254I ). Such models can be tailored to various applications (see reference 11225 II 
for an example in the neuroscience), and their further development can profit from 
research into spatial networks Il28ll . We proposed a framework to construct network 
null models which take into account the temporal sampling (finite length and fre- 
quency content of time series) as well as the applied methods for edge inference (cf. 
chapter |5] and reference 112321 ). Such network null models, which are currently used 
to study climate networks II255L may help to uncover previously hidden properties 
in interaction networks. 
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6 Conclusion 



We restricted our investigations to unweighted undirected networks, but we 
expect that the identified influences also leave an imprint on weighted and di- 
rected networks. The development of appropriate null models for such networks 
can be considered as promising and may profit from previous work (see refer- 
ences II256H258I and references therein). 

Recent years have undoubtedly seen tremendous success of the network ap- 
proach towards the analysis of the dynamics of complex systems. Currently, as 
the network approach matures, challenges increasingly become apparent in diverse 
scientific fields lll99ll225ll232limi25Tll25limi259ti2Ml and need to be met in order 
to avoid misinterpretations and to make progress. Such efforts promise to advance 
applied network science and can reward us with a far better characterization and 
deeper understanding of the dynamics of complex systems. 
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7 Appendix 

7.1 Identifying clusters in weighted networks 

We choose to identify clusters in weighted networks defined by their weight matri?^ 
W using an approach which is based on the concept of a random walk on the 
edge structure II971I98II . Such an approach is closely related to spectral clustering (see 
references II2621I263I for an overview, and references II2641I265I for early work in this 
area). The key idea is that nodes should belong to a cluster if the random walk 
stays long within the cluster and only seldom jumps to nodes not being part of the 
cluster. We define the transition probability matrix M of a Markov chain, 

M = WD-\ (7.1) 

with entries Wij > OVz, /, Wa = 1 Vz, and D is a diagonal matrix with entries djj = 
Wij. Mij represents the transition probability from node ; to i. A natural choice 
for a distance between nodes i and / in terms of transition probabilities would be to 
consider the vector distance between the z'th and the j'th column of M. Moreover, 
we can exploit the time evolution of the stochastic process by considering powers of 
M which allows us to explore the connectivity structure of nodes from a local to a 
global perspective Il98l . (M^),y with t > represents the transition probability from 
node j to z in T steps. Thus, we consider a weighted vector distance, the diffusion 
distance d^ M266I - I268II , between nodes z and 

N 

k=l 
N 

= L\^k\^"i^kt-Akj)\ (7.2) 
k=l 

where = E/,/ W* / E/ are the weights, A^, is the z'th component of the k'th 
normalized (E/^fa/Ci = 1) left eigenvector of M, and denote the corresponding 
eigenvalues (vi = 1 > \v2\ > ... > I^nI)- For t ^ oo, vanishes (|vjtp^ — ^ 
with k > 1, and An = IVz) representing a perspective in which all nodes belong 

^ Note that we assume all edges to exist, i.e., the adjacency matrix A. has entries Aij = l,i ^ j, and 
is zero else. 
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to a single cluster. In contrast, for t — ^ 0, becomes the identity matrix and 
increases for all pairs of nodes, which belongs to a perspective in which the network 
disintegrates into as many clusters as there are nodes. To identify a number q of 
clusters, we determine the corresponding time scale t = T{q) by requiring the 



{q + l)st eigenvalue to vanish, i.e.. 



^ where < ^ ^ 1 is a non-zero 



. Note 



small number (here we used ^ = 0.01), which leads to T{q) = In^/ In 
that equation (|7.2[) can be rewritten as Euclidean distance between vectors o{j) 
{\v]^\'^A]^j),k = 1,...,N associated with nodes If T{q) is chosen appropriately, 
contributions from terms k > q can be neglected and are zero for k = 1 since 
Ay = ivy. Thus, it is sufficient to consider Euclidean distances between "reduced" 
position vectors 

Ored(;) = (|Afc|Mfc^),fc = 2,...,^? (7.3) 

in a {q — 1) dimensional space only, which represents an effective dimensionality 
reduction. In this space, clusters are determined using the common k-means clus- 
tering algorithm f 269|| which is initialized with estimates of the cluster centers [97|. 
Partitions are determined for q = 1, . . . ,N, and the partition is chosen which maxi- 
mizes a quality function. We choose the modularity |9T| as quality function, because 
it has already been successfully used in different studies and its limitation have 
been thoroughly investigated Il96l . 



7.2 Duplication models 

Network models involving duplication processes have been studied in the context 
of gene duplication |270l l271| , which is considered a feature of biological evolu- 
tion. Many studies investigate protein-protein interaction networks, i.e., networks 
whose nodes are proteins (coded by genes) and whose edges represent binding in- 
teractions in a cell. The evolution process likely leaves an imprint in the topology 
of such networks via duplication, which is used in various modeling studies (see, 
e.g., II2721I273I ) and is exploited for analysis purposes B274I . 

We carry over concepts from duplication models in order to study the influence 
of common sources on the clustering coefficient, the average shortest path length, 
and the assortativity coefficient of interaction networks (cf . section 14.2. 2[) . Two dif- 
ferent duplication processes are considered. In the first model (cf . left column in fig- 
ure [T]!]), a node i is duplicated by introducing an additional node i'. i' is connected 
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(1) (2) 

OD ® CD ® 

I I 




Figure 7.1: Schematics showing the construction of (left column) and AQ^ (right 
column) out of M by duplication according to the first and second model, respec- 
tively. The exemplary network J\f consists of two nodes i and / which are connected 
(top row). The bottom row shows networks (left) and (right) derived from 
M. 

to all neighbours of i and, in addition, it is also connected to i (this corresponds to 
type B twins in |274 | | if nodes of arbitrary degrees are allowed). In the second model 
(cf . right column in figure \7.1} , the duplication of i introduces the duplicate node i' 
which is connected to the neighbours of i only (type A twins in II274I '). Let M denote 
some network of size N. In the following, we investigate properties of networks 
and A/J which are derived from M by applying the duplication process from the 
first model or the second model, respectively, to each single node of N. Note that 
Ml and Ml networks possess 2N nodes by construction. 

To simplify the notation, we refrain from introducing additional subindices or 
symbols to differently denote network characteristics of the two duplication models. 
Instead we report results obtained for the two models in separate paragraphs which 
allows one to distinguish between network properties of M^ or A/J networks. 

Clustering coefficient 

We recall the definition of the clustering coefficient. 
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where N denotes the number of nodes, 

C = I M^r^ ^^'^ ^ (7 5) 

' \ 0, iffc,G{0,l}. 

is the local clustering coefficient, kj denotes the degree of node i, and Aij is an 
entry of the adjacency matrix A. defining the network. Let Ej be the set of edges 
connecting neighbours of node i with each other, and let | E, | be the number of such 
edges. Note that 2|E;| = J^j^m •^ij'^inf^mi arid thus 

When considering a network J\fi or A/J derived by duplicating all nodes of the 
ancestor network J\f, \ E* \ denotes the number of edges between neighbours of node 
i in A/\* or A/'2*, and k* denotes the degree of node i in A/\* or A/'2*. 



First model. Note that |E* | = 4|E/| + 3ki and k* = 2ki + 1 for nodes i in M^. With 
equation (|7.6[) we obtain 

' it*(it* - 1) 2Jc, + 1 ^ {2k, + l)k, 2k, + 1^ ' {2k, + ly ^ ' 

which holds for k, > 0. For nodes i with k, = 0, the duplication produces isolated 
connected pairs of nodes, which results in C* = 0. Thus we obtain 



^ 0, if k, = 0. 



J I or if !<■ ^ 







Second model. Observe that |E*| = 4|E,| and fc? = 2ki. Thus, the local clustering 
coefficient of node / in reads 

fc*(fc*-l) 2k,{2k,-l) k*{k*-l) 'k,-\- 
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Average shortest path length 

The average shortest path length is given by 

^ = ^ = ^ E ^rj' (7-10) 

1^1 1^1 {t,j)es 

where 

S = {ihi) \ kj<oo; i,j = l,...,N} (7.11) 

denotes the set of ordered pairs of nodes for which a finite path of length 
exists, and Lg is the sum of the lengths of all shortest paths between these nodes. 
For the sake of brevity, we call | S \ the number of pairs of connected nodes in the 
following. 



First model. In order to derive L* of a network Af^, we consider the sum Lg of 
shortest paths in M-^. Afi is composed of two "layers". The first layer consists of the 
ancestor nodes while the second layer consists of the duplicate nodes derived from 
the ancestors. The sum of shortest paths within each layer is L5. Let us first neglect 
the edge between each ancestor and its duplicate node. Then, the sum of shortest 
paths established via all other edges between both layers will amount to 2L5. We 
now consider the edges between each ancestor and its duplicate node only, whose 
sum of shortest paths amounts to 2N since we treat the shortest path from node i 
to j and from node ; to i separately (see equations (|7.10[) and (|7.11|) ). Thus, 



LI = 4Ls + 2N. (7.12) 

Via the same line of reasoning, we obtain the number of pairs of connected nodes 
in at;, 

|S*|=4|S|. (7.13) 

Note that the number of pairs of connected nodes within each layer amounts to 
\S\ and contains self -connections of nodes (la = by definition). The remaining 
number of pairs of connected nodes 2|S| accounts for the paths between both lay- 
ers including the path between each ancestor node and its duplicate node. Using 
equations (|7.12[) and (|7.13|) we get 



LI N 

T7r^ = L+—-. (7.14) 
S* 2S ^ ' 
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Second model. In order to derive the average shortest path length of M^, we need 
to define the number of nodes without neighbours in M , 

NQ = \{i\k,=Q,i = l,...,N}\. (7.15) 

Following our line of reasoning presented above, we consider network J\f2 to be 
composed of two layers, the first containing nodes of N and the second containing 
all the duplicate nodes. The sum of shortest paths within each layer amounts to 
L5. Edges between both layers establish additional shortest paths whose sum is 
composed of two parts. The first part amounts to 2Ls and reflects all shortest paths 
between nodes of the two different layers excluding the path between each ancestor 
node and its duplicate node. The second part reflects the shortest paths between 
ancestor nodes i and their duplicate nodes i' . Note that shortest paths between i 
and i' only exist if ki > in J\f. If such a shortest path exists, its length must be 
liii = 2 due to the construction of Af-^- Taking into account that we distinguish 
between paths from i to i' and from i' to i, the second part amounts to 4(N — Nq). 
Thus we obtain 

=4Ls + 4(N-No). (7.16) 

To derive the number of pairs of connected nodes in A/"2*, we consider equation 
(|7.13|) . Note that the number of pairs of connected nodes where i and ; belong 
to different layers may be smaller than 2|S|. This is because nodes with no neigh- 
bours in A/" do not possess a connecting path to their duplicate nodes in Af-^ ■ The 
number of pairs of connected nodes in J\f2 thus reads 

|S*| =4|S| -2No. (7.17) 

The average shortest path length of is then given by 

L* = ^ = LiL + L2, (7.18) 

where 
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Assortativity coefficient 

Consider the set E of edges of a given network, and denote with Ig and mg the 
degrees of nodes at either end of edge e G E. We briefly recall the definition of the 
assortativity coefficient which is defined as the correlation coefficient (corr) between 
the degrees of nodes at the end of edges, 

, Cov(Z, m) Cov(l, m) 

a := corr /,m = = \ 7.20 

mm Var(l) 

where Cov(Z, m) denotes the covariance between the degrees of nodes at either 
end of edges, and ctj and Var(l) denote the standard deviation and variance of the 
degrees of nodes at one end of edges, respectively. The second equality in equa- 
tion (|7.20|) holds only for undirected networks since (T; = (r,« in such cases. 



We begin with collecting some facts. Let kj be the degree of node i and let N 
denote the number of nodes of the network. For the number | £ | of edges we obtain 

N 

|E| = E^- (7.21) 



i=l 



Furthermore, we observe that 



N N 
'.2 V" ;2 _ V" 7.3 



eEE i=l eeE i=l 



where I denotes the mean of the degrees of nodes at one end of the edges. Using 
these equations, it is straightforward to show that 



Eili ki 



Var(0 = Vn . ' (7-23) 



and 




Cov(/,m) = -^E^^'^^-(^l^l • (7-24) 

hi=l l^i eeE 



First model. Observe that the number of edges within the network is |E*| = 
4|£| + 2N (we treat each undirected edge as two directed ones) and the number 
of nodes is N* = 2N. Let /* denote the degree of a node at one end of edge e in 
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Ml - Note that each node i in Af has a new degree in A/\*, k* = Ikj + 1, and that its 
duplicate i' has the same degree k*, = k* . Let the node indices be ordered such that 
i G {1, . . .,N} are the ancestor nodes and i G {N + 1, . . .,2N} are the duplicate 
nodes. Thus, we can rewrite Ej^(fc* )' = 2 E?{k*Y for any value of s G {1,2,3}. By 
making use of these observations and equation (|7.23|) we obtain 



Var(/^ 



E£i(2fc, + 1)3 - (E£i(2fc. + 1)2)' / E£i(2fc. + 1) 



E£i(2fc. + i: 



(7.25) 



To derive the covariance Cov(/*,m*), we use 



N 



eeE* 



eeE 



i=l 



N 



4 E(2I, + l)(2m, + 1) + 2 E(2fc, + if 

eeE i=\ 

N N N 

16 J]/,m, + 16Efc2+4Efc, + 2E(2fc, + l) 



(7.26) 



We can eliminate term EeeE ^e^e by using equation (|7.24[) and thus we obtain 



Cov(/*,m*) = ( y I 

2E£i(2fc. + l) Lt^* 



1 



E£i(2fc, + 1) 

N 



N 



Er=i(2fe, + 1) 

E£i(2fc,+i: 

N 



8Cov(l,m)U]fc, +8 



N 



+ J](8/c2 + 2fc, + (2fc, + i: 

! = 1 



Lli{2k, + 1) 
Lli{2k, + 1] 



(7.27) 



With equations (17:2511 , (17:27b . and (17:20b we finally obtain 

, Cov(r,m*) 
' = Var(Z*) =^i^ + ^2 

Efc?-(Efc?)VEfc. 



with 



ai := 8 



E(2fc, + 1)3 - (E(2fc, + 1)2)2/ ^^2k, + 1] 



(7.28) 



(7.29) 



7.3 Proofs 
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and 



_ {8{Lk^){l + LkU + 2 Efc. + E(2fc, + 1)2 - ^^^^ 

+ 1)3 - (E(2/c, + 1)2)2/ ^(2fc. + 1) ' -' (^-30) 

where a denotes the assortativity coefficient of M. 



Second model. Let Ig and ing denote the degree of the nodes at either end of an 
edge e. Observe that the number of edges |E*| of A/J is four times the number of 
edge |E| in M, \E*\ = 4|E|. Each edge in J\f is represented by four edges in A/'2*, 
where the latter share all the same degrees at their ends. Moreover, for the degrees 
of nodes i in J\f2 holds k* = Ikj which carries over to the degrees of nodes at an 
end of an edge, I* = lie- Thus, 

E Z>* = 4 E />* = i:(2/*)(2m*) = E(4Z.)(4me). (7.31) 

eeE* eeE eeE eeE 

Therefore, the correlation coefficient of A/"2* can be expressed by 

a* = corr(Z*,m*) = corr(4Z,4m) = corr(l,m) = a, (7.32) 

where the third equality follows from the fact that the correlation coefficient is 
invariant to changes of scale of the variables (except for a sign). 



7.3 Proofs 



For the sake of completeness, the proofs needed in chapter |5] are presented. All con- 
tent of this section was kindly provided by Martin Wendler, University of Bochum, 
Germany, and was published in reference II232I . 



Lemma 1 

For every i,j G {1, . . . , N} with i ^ j, we have the following limit of the probability 
distribution of the empirical correlation: 



T \ 2 11 

-corr (x^,M,T,Xj^M,T) <x] ^ ^{x) with g{M) = -M+ -— (7.33) 



g(M) ^'^"'^^ ^ / ^ 3 ' 3M 



as T ^ oo, where O denotes the cumulative distribution function of a standard 
normal random variable. 
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Proof. In order to simplify the presentation, we write i//,M,r(0 = ^iMji^) ~ h 
so that Ey, a4,t(0 = 0- First note that i/f,M,r(0 ^ M-dependent sequence, i.e., for 
|s — ^1 > M, yi,M,T{s) and yi^Mji^) are independent. So we have that the covariance 

Cov (i//,M,T(l)y/,M,T(l)/!//,M,T(Oy/,M,T(0) =0 for T > M. 
Additionally, 

Cov (y;,M,T(l)y;,M,T(l),yi,M,T(0y7,M,T(0) = 

Cov (y/,M,r(l),yi,M,T(0) Cov (yj,M,T(l),y/,M,T(0) (7-34) 

and Cov (Z;(s),z,(f)) = Var (Zj(l)) if s = ^ and otherwise Cov (z/(s),Z/(f)) = 0. For 
1 < i < M, we obtain by the definition of the moving average and the independence 
of the underlying process Zj{t), t G N that 

I (M-{t-\) \2 

Cov(y,;M,T(l)y/,M,T(l),y!,M,T(Oy/,M,T(0) = I E ^ar (zy(s)) j (7.35) 

= ]^(^^-(^-l))'Var2(z.(l)). 
By the central limit theorem for M-dependent random variables, see reference M275I , 

T 



1 1 ^ 



Var (^1 ELi y,M,r(Oy;,M,T(i^ ^ 

converges in distribution to a standard normal random variable as T — ^ oo. Further- 
more, we have the following convergence for the variance as T — > oo: 

1 ^ 



TVar Ey':M,T(i)y7,M,T(0 

M 

Var(y^,M,T(l)y7,M,T(l)) + 2 J] Cov (yi,M,T(l)y;,M,T(l)/yf,M,T(Oy;,M,T(0) 
= [^ + ^, - - 1))') Var2 (z,(l)) = ^Var2 (z.(l)) . (7.37) 



7.3 Proofs 
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The last equality follows easily by Yli^i = "^"^"^^^^^""'"'^^ . With the same central limit 
theorem, A= YJ=\ Vi M t(0 converges to a normal limit, so -\- Ylt=i Vi M r(0 ^ in 
probability and consequently 



— Ly^MM — LviMAt) ^ o (7-38) 

,T4 t=i J t=l ) 

in probability as T — > cx). By similar arguments, we have that \ — > 
Var(i/,;M,T(1)) = iiVar (z;(l)) and ^ ELi 3/f,M,T(0 ^ 0, so we get 

1 ^ 1 ^ /i ^ V 

f=i t=i V '=1 / 

^ Var(y,M,T(l)) = ^Var (z,(l)) . (7.39) 



By Slutsky's theorem Il276l and with (|736)) . (l737|) , (|738)) . and (l739ll . we finally obtain 
that 



r 



^^^COrr(x,;M,T/ ^;,M,t) 

giM)^Zj=i{yi,M,Ti^) -yiM,TViLj=iiyiM,Tit) -yjMjV 



(7.40) 



converges in distribution to a standard normal random variable as T — > oo. This 
completes the proof. 



Lemma 2 



9 



For T ^ CO, R ^ CO 
in probability with reff(M) = 



,M,T ^2<D(-e) 



(7.41) 
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Proof. With Lemma 1, we have that 



H 



= P corr (x/,m,T/^;,m,t) > 



P PijMJ > 



^/3k(My 



PiiM,T >^]+p(\l <-e]^ 2^i-0) 



as T — > 00. Furthermore, H^^^]^ j, is bounded by and 1, so Var (^H^^^]^ < 5. By 
the independence of the R random networks 



Var e 



-.,M,T 



as R — )■ cx). The lemma follows with the Chebyshev inequality. 



< — ^ 
- 4R 
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